[R] About data manipulation
chocold12 at gmail.com
Sun Nov 27 08:03:30 CET 2016
Thanks Jim, this method is very convenient and is what I want. Could I know
how to save the resulted dataframe? It printed in the console directly.
On Sat, Nov 26, 2016 at 5:55 PM, jim holtman <jholtman at gmail.com> wrote:
> You did not provide any data, but I will take a stab at it using the
> "dplyr" package
> DT %>%
> group_by(month, id, note) %>%
> summarise(avg = mean(total))
> Jim Holtman
> Data Munger Guru
> What is the problem that you are trying to solve?
> Tell me what you want to do, not how you want to do it.
> On Sat, Nov 26, 2016 at 11:11 AM, lily li <chocold12 at gmail.com> wrote:
>> Hi R users,
>> I'm trying to manipulate a dataframe and have some difficulties.
>> The original dataset is like this:
>> year month total id note
>> 2000 1 98 GA 1
>> 2001 1 100 GA 1
>> 2002 2 99 GA 1
>> 2002 2 80 GB 1
>> 2012 1 78 GA 2
>> The structure is like this: when year is between 2000-2005, note is 1;
>> year is between 2006-2010, note is 2; GA, GB, etc represent different
>> groups, but they all have years 2000-2005, 2006-2010, 2011-2015.
>> I want to calculate one average value for each month in each time slice.
>> For example, between 2000-2005, when note is 1, for GA, there is one value
>> in month 1, one value in month 2, etc; for GB, there is one value in month
>> 1, one value in month 2, between this time period. So later, there is no
>> 'year' column, but other columns.
>> I tried the script: DF_GA = aggregate(total~year+month,data=subset(DF,
>> id==GA¬e==1)), but it did not give me the ideal dataframe. How to do
>> Thanks for your help.
>> [[alternative HTML version deleted]]
>> R-help at r-project.org mailing list -- To UNSUBSCRIBE and more, see
>> PLEASE do read the posting guide http://www.R-project.org/posti
>> and provide commented, minimal, self-contained, reproducible code.
[[alternative HTML version deleted]]
More information about the R-help