[R] merging/intersecting 2 data frames

Erin Hodgess erinm.hodgess at gmail.com
Tue Jun 29 21:21:45 CEST 2010


Dear R People:

I have two data frames, a.df and b.df as seen here:

> a.df[1:10,]
        DATE GENDER PATIENT_ID AGE             SYNDROME
1  4/16/2009      F      23686  45         RASH ON BODY
2  4/16/2009      F      13840  35         CANT URINATE
3  4/16/2009      M      12895  30       BLURRED VISION
4  4/16/2009      M      18375  33       UNABLE TO VOID
5  4/16/2009      M       2237  44         SOB WEAKNESS
6  4/16/2009      F      21484  41 TOOTH PAINTOOTH PAIN
7  4/16/2009      M      10783  37          RT ARM PAIN
8  4/16/2009      M      12610  65        L FOOT INJURY
9  4/16/2009      F       3495  29 URINARY DIFFICULTIES
10 4/16/2009      F        351  36           PT STS MVA
> b.df[1:10,]
   DATE_OF_DEATH    ID
1      4/19/2009 21676
2      4/19/2009 13717
3      4/19/2009 20498
4      4/19/2009 14281
5      4/19/2009 38848
6      4/20/2009   331
7      4/20/2009  4084
8      4/20/2009 19616
9      4/20/2009 17965
10     4/20/2009 11863
>

a.df will always be larger than b.df.

I want to create a third data frame that is matched on PATIENT_ID from
a.df and ID from b.df.

If there is no match from a.df$PATIENT_ID to b.df$ID, then we omit the
row from the new data.frame.

If there is a match, we include the DATE_OF_DEATH column from b.df.

I've tried all kinds of tricks, but nothing works exactly as I wish.

Thanks in advance,
Sincerely,
Erin


-- 
Erin Hodgess
Associate Professor
Department of Computer and Mathematical Sciences
University of Houston - Downtown
mailto: erinm.hodgess at gmail.com



More information about the R-help mailing list