[R] Question about rpart and regression trees

Paul Smith phhs80 at gmail.com
Tue Jan 23 12:13:28 CET 2007


On 1/23/07, Prof Brian Ripley <ripley at stats.ox.ac.uk> wrote:
> > I would like to use rpart to obtain a regression tree for a dataset
> > like the following:
> >
> > Y     X1      X2      X3      X4
> > 5.500033      B       A       3       2
> > 0.35625148    D       B       6       5
> > 0.8062546     E       C       4       3
> > 5.100014      C       A       3       2
> > 5.7000422     A       A       3       2
> > 0.76875436    C       A       6       5
> > 1.0312537     D       A       4       1
> >
> > Y is the objective variable. X1, X2, X3 and X4 can take, respectively,
> > the following values:
> >
> > X1: A,B,C,D,E
> > X2: A,B,C,D,E
> > X3: 3,4,5,6
> > X4. 1,2,3,4,5
> >
> > Should I convert X3 and X4 to factor before running rpart?
>
> If they really are factors, yes.
> If they are ordered factors, no.

Thanks, Prof. Ripley. Is it correct to adopt the same procedure in
case of classification trees, i.e., in case the objective variable (Y)
is categorical and X1, X2, X3 and X4 are as above?

Paul



More information about the R-help mailing list