[R] diamonds data set from ggplot2

Ebert,Timothy Aaron tebert @end|ng |rom u||@edu
Wed Apr 26 00:03:30 CEST 2023

Thank you for sharing the paper.

-----Original Message-----
From: R-help <r-help-bounces using r-project.org> On Behalf Of Hadley Wickham
Sent: Tuesday, April 25, 2023 9:44 AM
To: Sigbert Klinke <sigbert using wiwi.hu-berlin.de>
Cc: r-help using r-project.org
Subject: Re: [R] diamonds data set from ggplot2

[External Email]

On Tue, Apr 25, 2023 at 4:09 AM Sigbert Klinke <sigbert using wiwi.hu-berlin.de> wrote:
> Hi,
> is there any information about the source of this data set?
> 1.) I read the question of Marina Doucerain in 2012 about the time and
> Hadley Wickhams answer "I believe it was 2008."
> 2.) In kaggle someone said "It's a Tiffany & Co's snapshot pricelist
> from 2017."
> So, where the data set stems from? Which year?

Digging back into my old files, it looks like I created the data in Feb 2007 by scraping http://www.diamondse.info/ with a ruby script.
I've attached the paper I wrote about for an (unsuccessful) submission to the Journal of Statistical Education.



More information about the R-help mailing list