[R] vsize and nsize

Tony Long tdlong at uci.edu
Tue May 18 17:41:07 CEST 1999


Jim:

	So now I am confused.  I have two vectors: var and pow.  var has a
dozen or so "levels".  (I can not apply factor to var either)  Here is a
sample of my R session:

length(var)
94429
lenght(pow)
94429
tapply(pow,var,mean)
Error: heap memory (1953 Kb) exhausted [needed 737 more]....
var2 <- var[1:100]
pow2 <- pow[1:100]
tapply(pow2,var2,mean)
0	0.06

As I stated in previous messages the version of R I have ( 0.63.2 ) could
not increase heap memory, Brian Ripley suggested I upgrade to a newer
version (0.64.1, which I haven't done yet) -- so I can not tell you if this
solves the problem or not...

>>
>> Tony Long <tdlong at uci.edu> wrote:
>>
>> > 	I agree that R is not "designed" for large calculations.  On the
>> > other hand it is nice to have one statistical package to use for all
>> > calculations.  I mostly deal with Drosophila and DNA, as such I  am an
>> > amateur statistician and would like to avoid learning a number of
>> > statistical languages.  With a big Linux box, I can often power through
>> > things.  In the past I have found it frustrating to do a bunch of stuff in
>> > SAS only to hit a snag and then have to write (time consuming) "C" code to
>> > finish the job.  So although not designed for large calculations, R is so
>> > flexible and logical that it is very attractive to use it for such...I
>> > think that many other people may be similarly attracted to the
>>language.  I
>> > would appreciate dialog, as I think that much more may have been
>> > accomplished in R than was intended by the founders.
>>
>> R's poor handling of large datasets is half the reason I have
>> not moved more of my work from S(plus) to R (the other half
>> being the absence of trellis).  I love its lexical closures, but
>> they're not worth the memory penalty if you have huge datasets.
>
>I am wondering what you mean by "R's poor handling of large datasets".
>How large is large? I have often been working simultaneously with a
>fair number of vectors of say 40,000 using my libraries (data objects
>and functions) with no problems. They use the R scoping rules. On the
>other hand, if you use dataframes and/or standard functions like glm,
>then you are restricted to extremely small (toy) data sets. But then
>maybe you are thinking of gigabytes of data.
>-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-
>r-help mailing list -- Read http://www.ci.tuwien.ac.at/~hornik/R/R-FAQ.html
>Send "info", "help", or "[un]subscribe"
>(in the "body", not the subject !)  To: r-help-request at stat.math.ethz.ch
>_._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._


Tony Long
Ecology and Evolutionary Biology
Steinhaus Hall
University of California at Irvine
Irvine, CA
92697-2525

Tel:  (949) 824-2562   (office)          ****NOTE NEW AREA CODE****
Tel:  (949) 824-5994   (lab)              ****NOTE NEW AREA CODE****
Fax: (949) 824-2181                        ****NOTE NEW AREA CODE****

email:  tdlong at uci.edu 


-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-
r-help mailing list -- Read http://www.ci.tuwien.ac.at/~hornik/R/R-FAQ.html
Send "info", "help", or "[un]subscribe"
(in the "body", not the subject !)  To: r-help-request at stat.math.ethz.ch
_._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._



More information about the R-help mailing list