[R] Data visualization: overlay columns of train/test/validation datasets

David Winsemius dwinsemius at comcast.net
Wed Jul 2 01:42:57 CEST 2014

On Jul 1, 2014, at 3:46 PM, Supriya Jain wrote:

> Hello,
> Given two different datasets (having the same number and type of columns,
> but different observations, as commonly encountered in data-mining as
> train/test/validation datasets), is it possible to overlay plots
> (histograms) and compare the different attributes from the separate
> datasets, in order to check how similar the different datasets are?
> Is there a package available for such plotting together of similar columns
> from different datasets?

Possible. Assuming you just want frequency histograms (or ones using counts for that matter) it can be done in any of the three major plotting paradigms supported in R. No extra packages needed if using just base graphics.

> Thanks,
> SJ
> 	[[alternative HTML version deleted]]

Oh, you must have missed the parts of the Posign Guide where plain text was requyested. See below.

> PLEASE do read the posting guide http://www.R-project.org/posting-guide.html

And you missed that section, as well.

> and provide commented, minimal, self-contained, reproducible code.

David Winsemius
Alameda, CA, USA

More information about the R-help mailing list