[R] Table rearranging

Jeffrey Joh johjeffrey at hotmail.com
Fri Feb 10 22:47:23 CET 2012


Hi David,
Totally forgot line 3!  So "513 red" should be ok but "420 red" and "917 yellow" aren't.

Jeffrey

----------------------------------------
> CC: r-help at r-project.org
> From: dwinsemius at comcast.net
> To: johjeffrey at hotmail.com
> Subject: Re: [R] Table rearranging
> Date: Tue, 7 Feb 2012 21:51:55 -0500
>
>
> On Feb 7, 2012, at 8:43 PM, Jeffrey Joh wrote:
>
> >
> > Hi David, I am not sure how ddply/summarize solves my issue. I have
> > the following table:
> >
> > ID measurement date door color
> > 1 0.93529385 513 open red
> > 2 0.97419293 420 open red
> > 3 0.962053514 513 closed red
> > 4 0.963909937 1230 open blue
> > 5 0.97652034 1230 open green
> > 6 0.989310795 1230 closed blue
> > 7 0.9941022 917 closed yellow
> > 8 0.8945757 1230 open blue
> >
> > I only want to keep the lines that have corresponding open/closed
> > measurements. For example, I want to keep lines 4,6,8 because for
> > the "1230 blue" condition, there exists both open and closed
> > measurements.
> >
> > However, the "513 red" condition has an open measurement, but no
> > closed measurement.
>
> Huh? what about line 3?
>
> --
> David,
> > Therefore, line 1 should be deleted.
> >
> > Jeffrey
> >
> >
> > ----------------------------------------
> >> CC: r-help at r-project.org; wdunlap at tibco.com
> >> From: dwinsemius at comcast.net
> >> To: johjeffrey at hotmail.com
> >> Subject: Re: [R] Table rearranging
> >> Date: Tue, 7 Feb 2012 09:08:00 -0500
> >>
> >>
> >> On Feb 7, 2012, at 4:21 AM, Jeffrey Joh wrote:
> >>
> >>>
> >>> Thank you for your help, Bill.
> >>>
> >>>> From the original table (not the plyr output), I would like to
> >>>> remove all the lines that do not have a corresponding open/closed
> >>>> measurement. For example, if there is a Closed yellow measurement
> >>>> on 0917, but not an Open yellow 0917 measurement, then the Closed
> >>>> yellow should be deleted.
> >>>
> >>> How can I make this change?
> >>>
> >>
> >> In R you need to assign the results of a function to an object name
> >> so
> >> you code would look like:
> >>
> >> modified_data <- ddply(d, .(date, color), summarize,
> >> meanClosed=mean(measurement[door=="closed"]),
> >> nClosed=sum(door=="closed"))
> >>
> >> --
> >> David
> >>> Jeffrey
> >>>
> >>>
> >>> ----------------------------------------
> >>>> From: wdunlap at tibco.com
> >>>
> >>>> To: johjeffrey at hotmail.com; r-help at r-project.org
> >>>
> >>>> Subject: RE: [R] Table rearranging
> >>>
> >>>> Date: Tue, 7 Feb 2012 00:43:25 +0000
> >>>
> >>>>
> >>>
> >>>> Install and load the "plyr" package and try something like:
> >>>
> >>>>
> >>>
> >>>>> ddply(d, .(date, color), summarize,
> >>>
> >>>> + ddply(d, .(date, color), summarize
> >>>
> >>>> + meanClosed=mean(measurement[door=="closed"]),
> >>>> nClosed=sum(door=="closed"))
> >>>
> >>>> date color meanOpen nOpen meanClosed nClosed
> >>>
> >>>> 1 420 red 0.9741929 1 NaN 0
> >>>
> >>>> 2 513 red 0.9352938 1 0.9620535 1
> >>>
> >>>> 3 917 yellow NaN 0 0.9941022 1
> >>>
> >>>> 4 1230 blue 0.9639099 1 0.9893108 1
> >>>
> >>>> 5 1230 green 0.9765203 1 NaN 0
> >>>
> >>>>
> >>>
> >>>> Bill Dunlap
> >>>
> >>>> Spotfire, TIBCO Software
> >>>
> >>>> wdunlap tibco.com
> >>>
> >>>>
> >>>
> >>>>> -----Original Message-----
> >>>
> >>>>> From: r-help-bounces at r-project.org [mailto:r-help-bounces at r-project.org
> >>>>> ] On Behalf Of Jeffrey Joh
> >>>
> >>>>> Sent: Monday, February 06, 2012 4:28 PM
> >>>
> >>>>> To: r-help at r-project.org
> >>>
> >>>>> Subject: [R] Table rearranging
> >>>
> >>>>>
> >>>
> >>>>>
> >>>
> >>>>> I have a table that looks like this:
> >>>
> >>>>>
> >>>
> >>>>> measurement date door color
> >>>
> >>>>> 0.93529385 513 open red
> >>>
> >>>>> 0.97419293 420 open red
> >>>
> >>>>> 0.962053514 513 closed red
> >>>
> >>>>> 0.963909937 1230 open blue
> >>>
> >>>>> 0.97652034 1230 open green
> >>>
> >>>>> 0.989310795 1230 closed blue
> >>>
> >>>>> 0.9941022 917 closed yellow
> >>>
> >>>>>
> >>>
> >>>>> I would like to create a table that has: Open measurement, Closed
> >>>>> measurement, date, color. For every
> >>>
> >>>>> date/color combination, there should be two columns to represent
> >>>>> the door open/closed measurement.
> >>>
> >>>>>
> >>>
> >>>>> If there are multiple datapoints with a given door/date/color
> >>>>> combination, then they should be
> >>>
> >>>>> averaged.
> >>>
> >>>>> I would also like to make two columns to represent the number of
> >>>
> >>>>> datapoints that were averaged in determining the open/closed
> >>>
> >>>>> measurements.
> >>>
> >>>>>
> >>>
> >>>>> Jeffrey
> >>>
> >>>>>
> >>>
> >>>>> ______________________________________________
> >>>
> >>>>> R-help at r-project.org mailing list
> >>>
> >>>>> https://stat.ethz.ch/mailman/listinfo/r-help
> >>>
> >>>>> PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
> >>>
> >>>>> and provide commented, minimal, self-contained, reproducible code.
> >>>
> >>>
> >>> ______________________________________________
> >>> R-help at r-project.org mailing list
> >>> https://stat.ethz.ch/mailman/listinfo/r-help
> >>> PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
> >>> and provide commented, minimal, self-contained, reproducible code.
> >>
> >> David Winsemius, MD
> >> West Hartford, CT
> >>
> >
>
> David Winsemius, MD
> West Hartford, CT
>
 		 	   		  


More information about the R-help mailing list