[R] How to read HUGE data sets?

Gabor Grothendieck ggrothendieck at gmail.com
Fri Feb 29 14:39:51 CET 2008


On Fri, Feb 29, 2008 at 8:27 AM, Liviu Andronic <landronimirc at gmail.com> wrote:
> On 2/28/08, Gabor Grothendieck <ggrothendieck at gmail.com> wrote:
> >  The sqldf package can read a subset of rows and columns (actually any
> >  sql operation)
> >  from a file larger than R can otherwise handle.  It will automatically
> >  set up a temporary
> >  SQLite database for you, load the file into the database without going
> >  through R and
> >  extract just the data you want into R and then automatically delete
> >  the database.  All this
> >  can be done in 2 lines of code.
>
> Is it realistic to use this approach for datasets as big as 30-40 GB?

The SQLite site says SQLite is appropriate up to a few dozen gigabytes.
http://www.sqlite.org/whentouse.html

The only way to really know is to try it with your data.  Since it does not
involve much code it shouldn't take long to prepare a test.



More information about the R-help mailing list