[R] Data Step: Outer Join Rows, Inner Join Columns

Christian Schulz chschulz at email.de
Tue Sep 29 09:19:27 CEST 2009


Hi,

if you like to use sql you can use the sqldf package.
Another possibility  is  the  basic function merge.

HTH Christian


> Hi,
> 
> how can I do a data step in R which can be best described in SQL terminology as an outer join of rows (eg cases) and inner join of columns (variables)? In case of conflicting non-missing values the first dataset has higher always priority. Missing values should always be replaced by valid values (cf. coalesce function in SQL):
> 
> INPUT DATASET 1:
> 
>   A  B  C
> 1 11 NA 31
> 2 12 22 32
> 3 NA 23 33
> 
> INPUT DATASET 2:
> 
>   C  D  E
> 2 NA 42 52   
> 3 NA 43 53
> 4 34 44 54
> 
> OUTPUT DATASET:
>    A  B  C  D  E
> 1  11 NA 31 NA NA
> 2  12 22 32 42 52
> 3  NA 23 33 43 53
> 4  NA NA 34 44 54
> 
> Thanks in advance,
> 
> Will
> 
> ______________________________________________
> R-help at r-project.org mailing list
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.
>




More information about the R-help mailing list