[Rd] A couple of issues with colClasses/setAs
Prof Brian Ripley
ripley at stats.ox.ac.uk
Wed Sep 8 18:49:52 CEST 2004
On 8 Sep 2004, Peter Dalgaard wrote:
> Peter Dalgaard <p.dalgaard at biostat.ku.dk> writes:
> > To make it actually work, we should probably fixup the "what" that is
> > being passed to scan a bit further upstreams.
> > > Might be a good idea to teach colClasses about "factor".
> > That's what I thought. Other ideas would be to predefine some standard
> > date classes (it's a bit annoying that there's no way to give
> > auxiliary information like formats), and maybe to allow a second
> > header line containing class names.
> Done already, I see (except the class-header idea, which probably
> requires more thought). Thanks.
I've done some. If you allow columns to be dropped it gets increasing
hard to work out what as.is for example refers to, so I am looking at
allowing named vectors.
> While we're in there should we perhaps consider adding pseudo-classes
> etc? Or perhaps it is better with a catchall "Date-fmt", i.e., e.g.,
> "Date-%Y%m%d"? That'd be fairly easy to code:
> else if (length(grep("^Date-",colClasses[i])))
> as.Date(data[[i]], sub("^Date-(.*)$","\\1", colClasses[i]))
> although perhaps not so easy to document concisely.
Is it really so hard to convert such columns from character afterwards?
Brian D. Ripley, ripley at stats.ox.ac.uk
Professor of Applied Statistics, http://www.stats.ox.ac.uk/~ripley/
University of Oxford, Tel: +44 1865 272861 (self)
1 South Parks Road, +44 1865 272866 (PA)
Oxford OX1 3TG, UK Fax: +44 1865 272595
More information about the R-devel