[R] extract decision tree from rpart
timlee126 at yahoo.com
Thu May 14 16:09:50 CEST 2009
I am using rpart for my decision stump. I am trying to extract the learned
stump from the output of rpart.
cntrl <- rpart.control(maxdepth = 1, minsplit = learn-1,
maxsurrogate = 0, usesurrogate=0, maxcompete = 1,
cp = 0, xval = 0)
fit <- rpart(y~bx, weights = w/mean(w), control = cntrl)
After some while of searching rpart document, playing the code and asking
questions, I came to this tentative conclusion that:
if my stump is for classifiction, fit[]$var[] is the selected feature
ID, and fit$split[1,4] is the splitting value
if my stump is for regression, fit[]$var[] is the selected feature ID,
fit$split[1,4] is the splitting value, fit[]$yval[] and
fit[]$yval[] are the two constant values for the two leaves.
However I get stuck again. For example, for my classification stump, here is
a value of fit, where feature No. 59 is selected and split at 0.0065
node), split, n, deviance, yval
* denotes terminal node
1) root 236 236.0000 -2.831422e-17
2) bx.59< 0.0065 156 0.0000 -1.000000e+00 *
3) bx.59>=0.0065 80 116.7475 5.053073e-01 *
60 Levels: <leaf> bx.1 bx.2 bx.3 bx.4 bx.5 bx.6 bx.7 bx.8 bx.9 bx.10 ...
if I give fit[]$var[] to a variable rr, then it is wrong
> rr <- as.numeric(fit[]$var[])
Can someone tell me how to correctly get the decision tree info from the
output of rpart?
Thanks and regards!
View this message in context: http://www.nabble.com/extract-decision-tree-from-rpart-tp23541393p23541393.html
Sent from the R help mailing list archive at Nabble.com.
More information about the R-help