[R] diamonds data set from ggplot2

Hadley Wickham h@w|ckh@m @end|ng |rom gm@||@com
Tue Apr 25 15:43:36 CEST 2023


On Tue, Apr 25, 2023 at 4:09 AM Sigbert Klinke
<sigbert using wiwi.hu-berlin.de> wrote:
>
> Hi,
>
> is there any information about the source of this data set?
>
> 1.) I read the question of Marina Doucerain in 2012 about the time and
> Hadley Wickhams answer "I believe it was 2008."
>
> 2.) In kaggle someone said "It's a Tiffany & Co's snapshot pricelist
> from 2017."
>
> So, where the data set stems from? Which year?

Digging back into my old files, it looks like I created the data in
Feb 2007 by scraping http://www.diamondse.info with a ruby script.
I've attached the paper I wrote about for an (unsuccessful) submission
to the Journal of Statistical Education.

Hadley

-- 
http://hadley.nz

-------------- next part --------------
A non-text attachment was scrubbed...
Name: diamonds.pdf
Type: application/pdf
Size: 1088975 bytes
Desc: not available
URL: <https://stat.ethz.ch/pipermail/r-help/attachments/20230425/fa9e8c63/attachment-0001.pdf>


More information about the R-help mailing list