[R] Combining data.frames

Bert Gunter bgunter@4567 @end|ng |rom gm@||@com
Sun Mar 20 04:44:42 CET 2022


Merge by the common keys/column names is the default. Te question is likely
what to do with rows that don't  match.  That's  determined by 'all'
settings, which the OP may already have figured out.

On Sat, Mar 19, 2022, 7:16 PM Tom Woolman <twoolman using ontargettek.com> wrote:

> I'm trying hard to take tonight off and avoid booting up the laptop and
> launching R... :)   but you need to merge by the primary key(s), e.g.
> the common columns (common IVs) shared between the two dataframes.
>
>
> On 2022-03-19 21:38, Jeff Reichman wrote:
> > Tom
> >
> > Looks like I figured it out. Syntax issue - wrong "all" argument  (I
> > think)
> >
> > -----Original Message-----
> > From: Tom Woolman <twoolman using ontargettek.com>
> > Sent: Saturday, March 19, 2022 8:27 PM
> > To: reichmanj using sbcglobal.net
> > Cc: r-help using r-project.org
> > Subject: Re: [R] Combining data.frames
> >
> > You can also do "SQL-like" joins in the tidyverse with dplyr.
> >
> >
> > On 2022-03-19 21:23, Jeff Reichman wrote:
> >> Evening Tom
> >>
> >> Yest I've been playing with the merge function.  But haven't been able
> >> to achieve what I need. Could maybe the way to to and it might be my
> >> syntax
> >>
> >> -----Original Message-----
> >> From: Tom Woolman <twoolman using ontargettek.com>
> >> Sent: Saturday, March 19, 2022 8:20 PM
> >> To: reichmanj using sbcglobal.net
> >> Cc: r-help using r-project.org
> >> Subject: Re: [R] Combining data.frames
> >>
> >> Have you looked at the merge function in base R?
> >>
> >> https://www.rdocumentation.org/packages/base/versions/3.6.2/topics/mer
> >> ge
> >>
> >>
> >> On 2022-03-19 21:15, Jeff Reichman wrote:
> >>> R-Help Community
> >>>
> >>> I'm trying to combine two data.frames which each containing 10
> >>> columns of which they each share two common fields. Here are two
> >>> small test datasets.
> >>>
> >>> df1 <- data.frame(date =
> >>> c("2021-1-1","2021-1-1","2021-1-1","2021-1-1","2021-1-1",
> >>>
> >>> "2021-1-2","2021-1-2","2021-1-3","2021-1-3","2021-1-3"),
> >>>                   geo_hash =
> >>> c("abc123","abc123","abc456","abc789","abc246","abc123",
> >>>                                "asd123","abc789","abc890","abc123"),
> >>>                   ad_id =
> >>> c("a12345","b12345","a12345","a12345","c12345",
> >>>
> >>> "b12345","b12345","a12345","b12345","a12345"))
> >>> df2 <- data.frame(date =
> >>> c("2021-1-1","2021-1-1","2021-1-2","2021-1-3","2021-1-3"),
> >>>                   geo_hash =
> >>> c("abc123","abc456","abc123","abc789","abc890"),
> >>>                   event =
> >>> c("shoting","ied","protest","riot","protest"))
> >>>
> >>> I'm trying to combine them such that I get a combined data.frames
> >>> such as
> >>>
> >>> date                geo_hash        ad_id           event
> >>> 1/1/2021    abc123          a12345          shoting
> >>> 1/1/2021    abc123          b12345
> >>> 1/1/2021    abc456          a12345          ied
> >>> 1/1/2021    abc789          a12345
> >>> 1/1/2021    abc246          c12345
> >>>
> >>> Jeff
> >>>
> >>> ______________________________________________
> >>> R-help using r-project.org mailing list -- To UNSUBSCRIBE and more, see
> >>> https://stat.ethz.ch/mailman/listinfo/r-help
> >>> PLEASE do read the posting guide
> >>> http://www.R-project.org/posting-guide.html
> >>> and provide commented, minimal, self-contained, reproducible code.
>
> ______________________________________________
> R-help using r-project.org mailing list -- To UNSUBSCRIBE and more, see
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide
> http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.
>

	[[alternative HTML version deleted]]



More information about the R-help mailing list