New package: colSums

David Brahm brahm at alum.mit.edu
Mon Jan 7 18:16:08 CET 2002


I've uploaded a package colSums_1.0.tar.gz to CRAN /src/contrib/Devel.  It
contains functions colSums, colMeans, colVars, colStdevs, rowSums, rowMeans,
rowVars, and rowStdevs.  These do simple, fast arithmetic on columns/rows of a
matrix, or more generally across dimensions of an array, e.g. colSums(m) =
apply(m, 2, sum) but faster.  They should be compatible with the corresponding
S-Plus functions.

The core code was written by Doug Bates <bates at stat.wisc.edu>, as package
"MatUtils", and posted to R-help on July 19, 2001.  Many thanks to Doug, Peter
Dalgaard <p.dalgaard at biostat.ku.dk>, Thomas Lumley <tlumley at u.washington.edu>,
and Prof Brian Ripley <ripley at stats.ox.ac.uk> for their assistance; see the
R-help thread "colSums in C".

- Brian Ripley has taken up the torch to improve these functions further and
  put them in R-devel, hopefully (I think) for R-1.5.0.  That's why my package
  is in /src/contrib/Devel; it should be obsolete by the next release of R!  I
  expect complete argument compatibility, though, since we have both tried to
  be compatible with S-Plus.

- I detect NA's with a trick that Thomas Lumley suggested (it's not NA or NaN
  if x==x), which works really great for me, but may not be portable.  If it
  doesn't work for you, try modifying the #define line in colSums.c.  I would
  like to hear about platforms where this fails.

- colVars is very naive, e.g. I'm probably exacerbating roundoff error when
  mu >> sigma.  I personally don't worry because in finance, mu (return) is
  never >> sigma (risk) :-).  The S-Plus documentation for colVars claims they
  do something fancy with the "two-pass method described in Chan, Golub, and
  LeVeque (1983)" that I don't know anything about.

- I convert integers, logicals, and complex immediately to reals, which is
  probably inefficient.  Brian Ripley's version seems to do a better job.

- S-Plus does not have "rowStdevs" for reasons unknown, since it is simply
  defined as rowStdevs(x, ...) <- function(x, ...) sqrt(rowVars(x, ...)).

- For me, colSums is about 23x faster than apply (on a 400 x 40000 matrix).

-- 
                              -- David Brahm (brahm at alum.mit.edu)
-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-
r-announce mailing list -- Read http://www.ci.tuwien.ac.at/~hornik/R/R-FAQ.html
Send "info", "help", or "[un]subscribe"
(in the "body", not the subject !)  To: r-announce-request at stat.math.ethz.ch
_._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._



More information about the R-announce mailing list