[R] how to simplify a data.frame and add the counts of duplicate rows as a new column

Simone Gabbriellini simone.gabbriellini at gmail.com
Wed Mar 2 16:34:14 CET 2011


many thanks, this is really a great solution!

best,
Simone

Il giorno 02/mar/2011, alle ore 16.22, Scott Chamberlain ha scritto:

> see package plyr, especially the function ddply(), eg.., in your case:
> 
> ddply(dataframe, .(columnA, columnB), summarise, 
> 	columnC = length(columnB)
> )
> 
> Scott
> On Wednesday, March 2, 2011 at 9:10 AM, Simone Gabbriellini wrote:
> 
>> Hello List,
>> 
>> I would like to simplify a data.frame like this
>> 
>> columnA	columnB
>> user10	 proj12
>> user10	 proj19
>> user10	 proj12
>> 
>> into something like:
>> 
>> columnA	columnB	columnC
>> user10	 proj12	 2
>> user10	 proj19	 1
>> 
>> I know unique() can simplify the data.frame, but how to count and store the duplicates?
>> 
>> thanks in advance for any help.
>> 
>> best regards,
>> Simone
>> 
>> ______________________________________________
>> R-help at r-project.org mailing list
>> https://stat.ethz.ch/mailman/listinfo/r-help
>> PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
>> and provide commented, minimal, self-contained, reproducible code.
> 



More information about the R-help mailing list