[R] little manipulation on data frame

Thomas Lumley tlumley at u.washington.edu
Fri Jun 6 19:06:55 CEST 2003


On Fri, 6 Jun 2003, N Dey wrote:

> Dear all,
>
> I have data like 3 coulmns and many rows. Each entry
> is less than 10.
>
> Example
> 	x	y	z
> 1	5	3	2
> 2	3	7	8
> 3	8	9	5
> 4	5	4	6
> --------------------------
> ---------------------------
>
> I have to sum entries of each coulmn (seperately) till
> it be 10. This i have to start for each row. And I
> want to assign no. of rows needed including that row
> too(it to be 10 or 10+, the moment it exceeds 10, i
> need to stop and count the no. of rows)in additional
> coulmns say N1 (corresponding to coulmn x), N2 (y) and
> N3 (z).
>
>
> I want my new table like
>
> 	x	y	z	N1	N2	N3
> 1	5	3	2	3	2	2
> 2	3	7	8	2	2	2
> 3	8	9	5	2	2	2
> 4	5	4	6	depends upon next row
>

It depends a bit on how many is `many'.

You can get cumulative sums with cumsum, and the first entry in each
column is then

    min(which(cumsum(x) > 10))

The i+1th entry is
    min(which( cumsum(x)-cumsum(x)[-(1:i)] > 10))

If the number of rows is not very large I would do

   sumx<-cumsum(x)
   N1<-min(which(cumsum(x) > 10))
   N1<-c(N1, sapply(1:(length(x)-1), function(i)
		min(which(sumx[-(1:i)]-sumx[i]>10))))

or the equivalent for() loop.

If the number of rows is very large it would be more efficient to rely on
the fact that no more than 10 rows are needed (assuming that zeros aren't
possible)

   n<-length(x)
   sumx<-cumsum(x)

   sumlags<-matrix(nrow=n,ncol=10)
   for(i in 0:9)
	sumlags[,i+1]<-sumx[ c((i+1):n, rep(n,i))]

   N1<-rowSums(sumlags < c(0,sumx[1:(n-1)])+10)+1


	-thomas




More information about the R-help mailing list