[R] Efficient Package for Huge datasets in R

Shylashree U.R shylashivashree at gmail.com
Thu Sep 28 14:00:31 CEST 2017


Dear Sir/Madam,

I have a large data set of 10,17,289 observations of 10,830 variables. I
need to use PCA to reduce the dimension of dataset. I have already tried
irlba, prcomp and nsprcomp packages in R but couldn't do for huge data
sets.

i.e pc <- prcomp_irlba(sparseYY[1:5000,], n=50, retx = TRUE, center = TRUE,
scale. = FALSE)
able to get only few PCs for 5000 rows only

so can you please help me what package can i use to do PCA in R for large
dataset?


Thanks and Regards

	[[alternative HTML version deleted]]



More information about the R-help mailing list