[R] Looking for package for data generation for classification and regression

Tom Woolman twoo|m@n @end|ng |rom ont@rgettek@com
Thu Mar 3 22:04:15 CET 2022


Hi Paul. Have you considered just going onto Kaggle and GitHub and 
searching for some of the many freely available real datasets that are 
posted there? I'm seeing a lot of productivity there days with research 
focused on data generation, and not just on creating algorithms and 
predictive models. Which is a good thing for us ;)

One of the current research papers I'm working on now is based on mining 
a dataset I discovered on Kaggle a few months back and trying to create 
a novel solution for that. Proper credit will of course be provided in 
the citation references for the data provider.


Thanks,
Tom


On 2022-03-03 16:00, Paul Smith wrote:
> Dear All,
> 
> I am in need of generating artificial data for machine learning
> classification and regression analysis. What I am looking for is
> something similar to Python sklearn.datasets.make_classification and
> sklearn.datasets.make_regression:
> 
> https://scikit-learn.org/stable/modules/generated/sklearn.datasets.make_classification.html
> 
> https://scikit-learn.org/stable/modules/generated/sklearn.datasets.make_regression.html
> 
> I have searched CRAN for something similar, but found nothing. Could
> someone please help me with this?
> 
> Thanks in advance,
> 
> Paul
> 
> ______________________________________________
> R-help using r-project.org mailing list -- To UNSUBSCRIBE and more, see
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide 
> http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.



More information about the R-help mailing list