[R] set difference between two data frames

Simon Zehnder szehnder at uni-bonn.de
Thu Oct 31 21:51:40 CET 2013


You could e.g. take the data.table package (every data.table is a data.frame) and make a join:

dt.x <- data.table(x)
dt.y <- data.table(y)
merge.xy <- x[y, nomatch = 0]
diff.xy <- x[!merge.xy]



On 31 Oct 2013, at 21:41, Yasin Gocgun <entropy053 at gmail.com> wrote:

> Thanks. Actually, I forgot to add that both have the same number of columns.
> 
> On Thu, Oct 31, 2013 at 4:07 PM, Bert Gunter <gunter.berton at gene.com> wrote:
>> lapply() setdiff() by columns.
>> 
>> Unless you have failed to tell us something, you almost certainly will
>> not get a data frame (same number of rows/column) as your answer.
>> 
>> -- Bert
>> 
>> On Thu, Oct 31, 2013 at 12:58 PM, Yasin Gocgun <entropy053 at gmail.com> wrote:
>>> Hi,
>>> 
>>> I have two data frames, say, x and y, where y is a subset of x. How
>>> can I find the set difference of these two data frames (i.e., x-y)?
>>> 
>>> Thanks,
>>> 
>>> --
>>> Yasin Gocgun
>>> 
>>> ______________________________________________
>>> R-help at r-project.org mailing list
>>> https://stat.ethz.ch/mailman/listinfo/r-help
>>> PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
>>> and provide commented, minimal, self-contained, reproducible code.
>> 
>> 
>> 
>> --
>> 
>> Bert Gunter
>> Genentech Nonclinical Biostatistics
>> 
>> (650) 467-7374
> 
> 
> 
> -- 
> Yasin Gocgun
> 
> ______________________________________________
> R-help at r-project.org mailing list
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.



More information about the R-help mailing list