[R] Comparison of SAS & R/Splus

Marc Schwartz MSchwartz at medanalytics.com
Thu Sep 4 22:12:55 CEST 2003


On Thu, 2003-09-04 at 08:34, Frank E Harrell Jr wrote:
> On Thu, 04 Sep 2003 14:50:25 -0400
> "Paul, David  A" <paulda at BATTELLE.ORG> wrote:
> 
> > I am one of only 5 or 6 people in my organization making the
> > effort to include R/Splus as an analysis tool in everyday work -
> > the rest of my colleagues use SAS exclusively.
> > 
> > Today, one of them made the assertion that he believes the
> > numerical algorithms in SAS are superior to those in Splus
> > and R -- ie, optimization routines are faster in SAS, the SAS
> > Institute has teams of excellent numerical analysts that
> > ensure its superiority to anything freely available, PROC 
> > NLMIXED is more flexible than nlme( ) in the sense that it 
> > allows a much wider array of error structures than can be used 
> > in R/Splus, &etc.  
> > 
> > I obviously do not subscribe to these views and would like 
> > to refute them, but I am not a numerical analyst and am still 
> > a novice at R/Splus.  Do there exist refereed papers comparing the 
> > numerical capabilities of these platforms?  If not, are there 
> > other resources I might look up and pass along to my colleagues?
> > 
> > 
> > 
> > Much thanks in advance,
> >  
> >  david paul
> 
> I don't have papers comparing the numerical capabilities but I say
> bunk to your colleagues.  The last time I looked, SAS still relies on
> the out of date Gauss-Jordan sweep operator in many key places, in
> place of the QR decomposition that R and S-Plus use in regression. 
> And SAS being closed source makes it impossible to see how it really
> does calculations in some cases.
> 
> See http://hesweb1.med.virginia.edu/biostat/s/doc/splus.pdf Section
> 1.6 for a comparison of S and SAS (though this doesn't address
> numerical reliability).  Overall, SAS is about 11 years behind R and
> S-Plus in statistical capabilities (last year it was about 10 years
> behind) in my estimation.
> 
> Frank Harrell
> SAS User, 1969-1991


In follow up to Frank's reply, allow me to point you to some additional
papers and articles on numerical accuracy issues. I have not reviewed
these in some time and they may be a bit dated relative to current
versions. These do not cover R specifically, but do address S-Plus and
SAS. This is not an exhaustive list by any means, but many of the papers
do have other references that may be of value.


1. http://www.stat.uni-muenchen.de/~knuesel/elv/accuracy.html

2. http://www.amstat.org/publications/tas/mccull-1.pdf

3. http://www.amstat.org/publications/tas/mccull.pdf

4. http://www.npl.co.uk/ssfm/download/documents/cmsc06_00.pdf


Another option is that NIST has reference datasets available for comparison at:

http://www.itl.nist.gov/div898/strd/

These would allow you to conduct your own comparisons if you desire.

HTH,

Marc Schwartz
(Also a former SAS user)




More information about the R-help mailing list