[R] conditional grouping of variables: ave or tapply or by or???

hadley wickham h.wickham at gmail.com
Fri Apr 24 01:50:32 CEST 2009


On Thu, Apr 23, 2009 at 5:11 PM, ozan bakis <ozanbakis at gmail.com> wrote:
> Dear R Users,
> I have the following data frame:
>
> v1 <- c(rep(10,3),rep(11,2))
> v2 <- sample(5:10, 5, replace = T)
> v3 <- c(0,1,2,0,2)
> df <- data.frame(v1,v2,v3)
>> df
>  v1 v2 v3
> 1 10  9  0
> 2 10  5  1
> 3 10  6  2
> 4 11  7  0
> 5 11  5  2
>
> I want to add a new column v4 such that its values are equal to the value
> of v2 conditional on v3=0 for each subgroup of v1. In the above example,
> the final result should be like
>
> df$v4 <- c(9,9,9,7,7)
>> df
>  v1 v2 v3 v4
> 1 10  9  0  9
> 2 10  5  1  9
> 3 10  6  2  9
> 4 11  7  0  7
> 5 11  5  2  7
>
>
> I tried the following commands without success.
>
> df$v4 <- ave(df$v2, df$v1, FUN=function(x) x[df$v3==0])
> tapply(df$v2, df$v1, FUN=function(x) x[df$v3==0])
> by(df$v2, df$v1, FUN=function(x) x[df$v3==0])
>
> Any help? Thanks in advance!

Here's one approach with the plyr package, http://had.co.nz/plyr

library(plyr)
ddply(df, .(v1), transform, v4 = v2[v3 == 0])

Hadley


-- 
http://had.co.nz/




More information about the R-help mailing list