[R] calculate column means when missing data is present

Denver XU financialengine at gmail.com
Tue Nov 27 02:21:31 CET 2007


colMeans(X,na.rm=T)
?colMeans




2007/11/27, Luke Neraas <lukasneraas.r at gmail.com>:
> # Hello,
>
> # I have a matrix that contains some missing values denoted by NA.
> # I would like to calculate the mean for each column and not have the
> # NA values included.  here is a sample matrix  "X" below.
>
>
> c1 <- c(1,4,3,2,NA,2,5,6,4,2)
> c2 <- c(1,3,6,NA,NA,2,9,6,1,2)
> c3 <- c(2,4,2,NA,4,2,NA,6,3,5)
> c4 <- c(2,6,4,NA,4,2,NA,6,1,5)
> c5 <- c(2,3,2,NA,4,2,9,6,NA,5)
>
> X<-cbind(c1,c2,c3,c4,c5)
>
> X
>
> # I have tried a variety of methods to calculate the Column means but each time
> # i do not get the result i am looking for
>
> result<-colMeans(X)  # only returns a mean of NA
>
>
> X <- X[!is.na(X)]    # Effectively removes the NA values but now I
> cannot calculate Column Means
>
> #  Below is the result of the column means I am looking for
>
> result
>        c1   c2    c3   c4    c5
>  [1,] 3.22 3.75  3.5  3.75 4.125
>
>
> # any help or ideas would be greatly appreciated
>
> # thanks in advance
>
> Luke Neraas
>
> lukasneraas.r at gmail.com
>
> University of Alaska Fairbanks
> School of Fisheries and Ocean Sciences
> 11120 Glacier Highway
> UAF Fisheries Division
> Juneau, AK 99801
>
> ______________________________________________
> R-help at r-project.org mailing list
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.
>



More information about the R-help mailing list