[R] list of matrices

arun smartpink111 at yahoo.com
Thu Jan 3 06:33:30 CET 2013


HI Eliza,
I looked into your data.
It's not a list of matrices, but a "list of data.frames" and as suspected, some of the columns were "chr" instead of "num".
str(d) #only selected list elements which showed the anomaly
$ :'data.frame':    1998 obs. of  13 variables:
  ..$ Col0 : num [1:1998] 1 2 3 4 5 6 7 8 9 10 ...
  ..$ Col1 : num [1:1998] 396 396 396 396 396 ...
  ..$ Col2 : num [1:1998] 379 371 371 371 371 ...
  ..$ Col3 : chr [1:1998] "353.75" "354.0" "345.0" "345.0" ...
  ..$ Col4 : num [1:1998] 354 354 362 362 362 ...
  ..$ Col5 : num [1:1998] 492 447 561 527 527 ...
  ..$ Col6 : chr [1:1998] "1154.64" "1334.0" "1002.0" "849.0" ...
  ..$ Col7 : num [1:1998] 3283 2888 2648 2662 2803 ...
  ..$ Col8 : num [1:1998] 5603 5663 5607 5578 5635 ...
  ..$ Col9 : num [1:1998] 3594 3313 2973 2662 2662 ...
  ..$ Col10: num [1:1998] 1053 1019 1002 951 917 ...
  ..$ Col11: num [1:1998] 515 501 493 481 479 470 462 459 447 447 ...
  ..$ Col12: num [1:1998] 405 402 396 396 388 379 379 377 371 365 ...
 $ :'data.frame':    1998 obs. of  13 variables:
  ..$ Col0 : num [1:1998] 1 2 3 4 5 6 7 8 9 10 ...
  ..$ Col1 : num [1:1998] -1 -1 -1 -1 -1 -1 -1 -1 -1 -1 ...
  ..$ Col2 : num [1:1998] -1 -1 -1 -1 -1 -1 -1 -1 -1 -1 ...
  ..$ Col3 : chr [1:1998] "-1.0" "-1.0" "-1.0" "-1.0" ...
  ..$ Col4 : num [1:1998] -1 -1 -1 -1 -1 -1 -1 -1 -1 -1 ...
  ..$ Col5 : chr [1:1998] "-1.0" "-1.0" "-1.0" "-1.0" ...
  ..$ Col6 : chr [1:1998] "-1.0" "-1.0" "-1.0" "-1.0" ...
  ..$ Col7 : chr [1:1998] "-1.0" "-1.0" "-1.0" "-1.0" ...
  ..$ Col8 : num [1:1998] -1 -1 -1 -1 -1 -1 -1 -1 -1 -1 ...
  ..$ Col9 : num [1:1998] 6311 6452 6396 6169 5717 ...
  ..$ Col10: num [1:1998] 1695 1610 1568 1506 1407 ...
  ..$ Col11: num [1:1998] 645 640 634 628 617 ...
  ..$ Col12: num [1:1998] 470 464 461 456 453 ...
 
 $ :'data.frame':    1998 obs. of  13 variables:
  ..$ Col0 : num [1:1998] 1 2 3 4 5 6 7 8 9 10 ...
  ..$ Col1 : num [1:1998] 29.4 29.4 28.9 29.7 30.6 ...
  ..$ Col2 : num [1:1998] 29.1 29.4 28.9 28 28 27.5 27.5 27.5 27.5 27.5 ...
  ..$ Col3 : num [1:1998] 30.6 31.1 30.3 30.3 31.1 30.3 30.6 30.6 29.7 31.1 ...
  ..$ Col4 : num [1:1998] 27.1 26.6 24.8 22.1 26.6 26.6 26.6 26.6 26.6 26.6 ...
  ..$ Col5 : num [1:1998] 45.3 46.4 46.4 47.5 50.4 51.5 53.2 53.2 54.3 55.5 ...
  ..$ Col6 : num [1:1998] 137 145 150 161 168 172 180 183 187 220 ...
  ..$ Col7 : chr [1:1998] "225.834" "211.684" "194.704" "186.78" ...
  ..$ Col8 : num [1:1998] 337 504 583 405 376 ...
  ..$ Col9 : num [1:1998] 388 359 359 357 334 ...
  ..$ Col10: num [1:1998] 129 126 122 111 109 103 100 97.1 93.7 91.1 ...
  ..$ Col11: num [1:1998] 43.6 38.5 40.2 47.5 44.1 41.9 39.1 38.5 36.8 36.2 ...
  ..$ Col12: num [1:1998] 28.3 27.1 26.1 26.6 26.6 25.7 25.2 26.6 26.6 26.1 ...

When I checked the data more closely in sheet 11 (11, 12, 14 with anomaly), found that in those chr columns, some numbers are followed by space and then again 1 in the same cell.  For eg. (326.5 1).

d1<-lapply(d,function(x) {do.call(data.frame,ifelse(grepl("\\d+\\s+\\d+",x),as.data.frame(apply(x,2,function(x) as.numeric(gsub("[ ]","",x)))), x))})
d2<-lapply(d1,function(x) {names(x)<- paste("Col",0:12,sep=""); x})
 #lapply(d2,function(x) sapply(x,is.numeric)) #can check whether all the columns of list elements are numeric.
# str(d2)

e<-lapply(seq_along(d2), function(i) {d2[[i]][apply(d2[[i]],1,function(x)any(!is.na(x))),]})
f<-lapply(seq_along(e), function(i) {e[[i]][apply(e[[i]],2,function(x)any(!is.na(x))),]}) #not sure why you need these two steps. Instead 

#f1<-lapply(d2,na.omit)

r<- lapply(f, function(x){replace(x, x == -1, NA)})

# r1<- lapply(f1, function(x){replace(x, x == -1, NA)})

 sr<-lapply(r,function(x) colMeans(x,na.rm=TRUE))

#sr1<-lapply(r1,function(x) colMeans(x,na.rm=TRUE))
names(sr)<-paste("sr",1:16,sep="")

#names(sr1)<-paste("sr",1:16,sep="")
res<-do.call(rbind,sr)

head(res,3)
#         Col0      Col1      Col2      Col3      Col4      Col5      Col6
#sr1  21.53977 445.82619 440.41702 519.98165 859.25831 2186.7239 5164.3338
#sr2  75.75148  38.60398  48.15517 103.37507 200.48586  290.2050  427.8965
#sr3 617.59607  76.16458  68.22891  67.89351  92.70717  190.4647  489.7183
 #        Col7      Col8      Col9      Col10     Col11     Col12
#sr1 7419.6656 6497.9376 2947.0133 1096.80798 678.91826 527.60351
#sr2  426.5273  297.5692  137.7833   72.22109  49.69003  43.67117
#sr3  800.9860  748.0768  371.0196  162.00639 110.73111  89.26638
 tail(res,2)
#          Col0      Col1      Col2     Col3      Col4     Col5      Col6
#sr15 615.98058  53.62532  49.89996 43.91249  44.45947  96.7349  462.8477
#sr16  21.53931 114.94487 102.43655 94.68324 126.93070 407.4416 1336.4197
#         Col7     Col8     Col9    Col10    Col11     Col12
#sr15 1258.418 1306.892 520.8200 157.4494  92.5196  67.39349
#sr16 2114.459 1855.358 859.8711 318.6484 183.9742 138.65578

 
#res1<-do.call(rbind,sr1) #results are a bit different
 #head(res1,2)
  #      Col0     Col1      Col2     Col3     Col4      Col5     Col6      Col7
#sr1 15.72368 458.3062 440.41702 518.4893 853.6490 2169.3466 5144.764 7423.5169
#sr2 15.91228  39.8461  48.15517 103.2362 200.2069  289.7981  427.628  426.8419
  #       Col8     Col9      Col10     Col11     Col12
#sr1 6541.7739 2964.951 1101.95443 679.90060 527.97005
#sr2  298.3639  137.985   72.27442  49.71482  43.71521
> 


                                         

Hope this helps.
A.K.






----- Original Message -----
From: eliza botto <eliza_botto at hotmail.com>
To: "r-help at r-project.org" <r-help at r-project.org>
Cc: 
Sent: Wednesday, January 2, 2013 5:16 PM
Subject: [R] list of matrices


dear useRs,
i have a list containing 16 matrices. i want to calculate  the column mean of each of them.
i tried
>sr <- lapply(s,function(x) colMeans(x, na.rm=TRUE))
but i am getting the following error
>Error in colMeans(x, na.rm = TRUE) : 'x' must be numeric
can it be done in any other way? and why i am getting this error??
thanks in advance..
elisa                           
    [[alternative HTML version deleted]]

______________________________________________
R-help at r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.





More information about the R-help mailing list