[R] Creating data frame with residuals of a data frame

jim holtman jholtman at gmail.com
Wed Oct 26 18:25:46 CEST 2011


try this:

> age<- c(5,6,10,14,16,NA,18)
> value1<- c(30,70,40,50,NA,NA,NA)
> value2<- c(2,4,1,4,4,4,4)
> df<- data.frame(age, value1, value2)
>
> #Run linear regression to adjust for age and get residuals:
>
> lm_f <- function(x) {
+ x<- residuals(lm(data=df, formula= x ~ age))
+ }
> resid <- apply(df,2,lm_f)
> resid<- resid[-1]
> for (i in names(resid)){
+     newCol <- paste(i, 'res', sep = '')
+     df[[newCol]] <- NA  # initialize
+     df[[newCol]][as.integer(names(resid[[i]]))] <- resid[[i]]
+ }
> df
  age value1 value2  value1res   value2res
1   5     30      2 -16.945813 -0.37398374
2   6     70      4  22.906404  1.50406504
3  10     40      1  -7.684729 -1.98373984
4  14     50      4   1.724138  0.52845528
5  16     NA      4         NA  0.28455285
6  NA     NA      4         NA          NA
7  18     NA      4         NA  0.04065041


On Mon, Oct 24, 2011 at 10:23 AM, francesca casalino
<francy.casalino at gmail.com> wrote:
> Dear experts,
>
> I am trying to create a data frame from the residuals I get after
> having applied a linear regression to each column of a data frame, but
> I don't know how to create this data frame from the resulting list
> since the list has differing numbers of rows.
>
> So for example:
> age<- c(5,6,10,14,16,NA,18)
> value1<- c(30,70,40,50,NA,NA,NA)
> value2<- c(2,4,1,4,4,4,4)
> df<- data.frame(age, value1, value2)
>
> #Run linear regression to adjust for age and get residuals:
>
> lm_f <- function(x) {
> x<- residuals(lm(data=df, formula= x ~ age))
> }
> resid <- apply(df,2,lm_f)
> resid<- resid[-1]
>
> Then resid is a list with different row numbers:
>
> $value1
>         1          2          3          4
> -16.945813  22.906404  -7.684729   1.724138
>
> $value2
>          1           2           3           4           5           7
> -0.37398374  1.50406504 -1.98373984  0.52845528  0.28455285  0.04065041
>
> I am trying to get both the original variable and their residuals in
> the same data frame like this:
>
> age, value1, value2, resid_value1, resid_value2
>
> But when I try cbind or other operations I get an error message
> because they do not have the same number of rows. Can you please help
> me figure out how to solve this?
>
> Thank you.
>
> ______________________________________________
> R-help at r-project.org mailing list
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.
>



-- 
Jim Holtman
Data Munger Guru

What is the problem that you are trying to solve?



More information about the R-help mailing list