[R] modeling large data

Benilton Carvalho bcarvalh at jhsph.edu
Sat Oct 20 18:43:47 CEST 2007


And it's likely you want to combine biglm with a database approach,  
either by using your own dbs (RSQLite or equivalent) or using tools  
like the 'sqlitedf' pkg.

B

On Oct 20, 2007, at 12:26 PM, Benilton Carvalho <bcarvalh at jhsph.edu>  
wrote:

> And I suggested to use the 'biglm' package. Didn't it work? It has
> examples that I found useful.
>
> B
>
> On Oct 20, 2007, at 11:41 AM, "Wensui Liu" <liuwensui at gmail.com>  
> wrote:
>
>> Hi, Dear Listers,
>> Several days ago, I posted a question regarding modeling large  
>> dataset
>> in R. Could anyone with such experience shed some light on it?
>> I truly appreciate it.
>>
>> wensui
>>
>> On 10/17/07, Wensui Liu <liuwensui at gmail.com> wrote:
>>> Hi, Dear Listers,
>>> I am just curious if R is able to model a logistic regression with
>>> 2-3000 variables and 30-40 million records.
>>> I know R is good but just want to know how good is to use it in a
>>> business envirnment such as database marketing.
>>> Thank you so much for you insight!
>>>
>>>
>>> --
>>> ===============================
>>> WenSui Liu
>>> Statistical Project Manager
>>> ChoicePoint Precision Marketing
>>> (http://spaces.msn.com/statcompute/blog)
>>> ===============================
>>>
>>
>>
>> -- 
>> ===============================
>> WenSui Liu
>> Statistical Project Manager
>> ChoicePoint Precision Marketing
>> (http://spaces.msn.com/statcompute/blog)
>>
>> ______________________________________________
>> R-help at r-project.org mailing list
>> https://stat.ethz.ch/mailman/listinfo/r-help
>> PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
>> and provide commented, minimal, self-contained, reproducible code.
>
> ______________________________________________
> R-help at r-project.org mailing list
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.



More information about the R-help mailing list