[R] histogram

arun smartpink111 at yahoo.com
Mon Mar 4 21:26:44 CET 2013


Hi,

dat1<- read.csv("rightest.csv",sep=",",header=TRUE,check.names=FALSE)
 dat2<- as.dist(dat1[,-1],upper=F,diag=F)
vec1<- as.vector(dat2)
label1=c("0-25","25-50","50-75")
Name1<-unlist(lapply(0:123,function(i) rep(i+1,i)))
dat3<-data.frame(Name1,vec1)
res<-t(aggregate(.~Name1,data=dat3,function(x) table(cut(x,breaks=seq(0,75,25),labels=label1))))
colnames(res)<- res[1,]
res1<- res[-1,]
row.names(res1)<-gsub("vec1.","",row.names(res1))
res1
Names2<-apply(res1,1,function(x) paste(which(x!=0),collapse=","))
res2<- data.frame(Frequency=apply(res1,1,function(x) sum(1*(x!=0))), stations=Names2,stringsAsFactors=FALSE)

res2
#      Frequency
#0-25        121
#25-50       122
#50-75        76
                                                                                                                                                                                                                                                                                                                                                                                           #stations
#0-25    #1,3,5,6,7,8,9,10,11,12,13,14,15,16,17,18,19,20,21,22,23,24,25,26,27,28,29,30,31,32,33,34,35,36,37,38,39,40,41,42,43,44,45,46,47,48,49,50,51,52,53,54,55,56,57,58,59,60,61,62,63,64,65,66,67,68,69,70,71,72,73,74,75,76,77,78,79,80,81,82,83,84,85,86,87,88,89,90,91,92,93,94,95,96,97,98,99,100,101,102,103,104,105,106,107,108,109,110,111,112,113,114,115,116,117,118,119,120,121,122,123
#25-50 #2,3,4,5,6,7,8,9,10,11,12,13,14,15,16,17,18,19,20,21,22,23,24,25,26,27,28,29,30,31,32,33,34,35,36,37,38,39,40,41,42,43,44,45,46,47,48,49,50,51,52,53,54,55,56,57,58,59,60,61,62,63,64,65,66,67,68,69,70,71,72,73,74,75,76,77,78,79,80,81,82,83,84,85,86,87,88,89,90,91,92,93,94,95,96,97,98,99,100,101,102,103,104,105,106,107,108,109,110,111,112,113,114,115,116,117,118,119,120,121,122,123
#50-75                                                                                                                                     #10,16,22,25,27,30,31,33,34,35,36,37,38,39,40,41,47,48,50,53,56,58,59,61,64,65,68,69,70,73,75,76,77,78,79,80,81,82,84,85,86,87,88,89,90,91,92,93,94,95,96,97,98,99,100,101,102,103,104,105,106,107,108,109,110,111,112,113,114,115,116,117,118,119,121,123

A.K.

________________________________
From: eliza botto <eliza_botto at hotmail.com>
To: "smartpink111 at yahoo.com" <smartpink111 at yahoo.com> 
Sent: Monday, March 4, 2013 3:21 PM
Subject: RE: histogram



Dear Arun,


Thanks for replying....
Although codes well defined my problem but the table in the end should look like the following

its just an imaginary table.....
Range                          stations                                         Frequency
0-25                           1,2,3,8,9                                           5
25-50                          4,10,11,100                                         4
50-75                          55,56,57                                            3
Where the "station" column shows the stations where distance of station is between the corresponding range.... like 1,2,3,8,9 have the distance between 0-25

i hope you wont mind

elisa




> Date: Mon, 4 Mar 2013 11:56:43 -0800
> From: smartpink111 at yahoo.com
> Subject: Re: histogram
> To: eliza_botto at hotmail.com
> CC: r-help at r-project.org
> 
> Hi Elisa,
> 
> I am not sure about the output you wanted.
> dat1<- read.csv("rightest.csv",sep=",",header=TRUE,check.names=FALSE)
>  dat2<- as.dist(dat1[,-1],upper=F,diag=F)
> vec1<- as.vector(dat2)
> label1=c("0-25","25-50","50-75")
> Count1<- as.data.frame(table(cut(vec1,breaks=seq(0,75,25),labels=label1))) #Overall count
>  Count1
> #   Var1 Freq
> #1  0-25 5465
> #2 25-50 1992
> #3 50-75  169
> 
> 
> Name1<-unlist(lapply(0:123,function(i) rep(i+1,i)))
>  length(Name1)
> #[1] 7626
> dat3<-data.frame(Name1,vec1)
> res<-t(aggregate(.~Name1,data=dat3,function(x) table(cut(x,breaks=seq(0,75,25),labels=label1))))
> colnames(res)<- res[1,]
>  res1<- res[-1,]
> row.names(res1)<-gsub("vec1.","",row.names(res1))
> res1
> #      2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28
> #0-25  1 0 2 0 2 3 2 1  1  1  3  1  1  3  2  3  6  3  5  2  4  8 13 21 21 23 20
> #25-50 0 2 1 4 3 3 5 7  8  8  8 11 12 11 13 12 11 15 14 18 17 12 10  3  2  3  6
> #50-75 0 0 0 0 0 0 0 0  0  1  0  0  0  0  0  1  0  0  0  0  0  2  0  0  2  0  1
> -----------------------------------------------------------------------------------------------------
> 
> A.K.
> 
> 
> 
> 
> 
> ________________________________
> From: eliza botto <eliza_botto at hotmail.com>
> To: "smartpink111 at yahoo.com" <smartpink111 at yahoo.com> 
> Sent: Monday, March 4, 2013 11:36 AM
> Subject: histogram
> 
> 
> 
> Dear Arun,
> 
> i have a distance matrix as attached in excel file with this email. You can read the data via R and 
> after reading the data i want you to extract the lower part of distance matrix by 
> as.dist(x, upper=F, diag=F). You will see that there 
> are 124 stations in my study. After that, i want to divide the data into three intervals 0-25, 25-75, 
> 75-100. Then i want to count the number of stations falling in each interval, which will be called 
> "Frequency". After that i want to draw the following table
> Range                                                         stations                                                                 Frequency
> 0-25                                                   names of station                                                      Number of stations
> 25-50      
> 50-75
> Finally, i want to draw histogram. i know i asked same kind of question before, but those commands are not working on distance matrix.
> 
> thankyou very very much in advance
> elisa



More information about the R-help mailing list