[R] multinomial: stepwise selection using prediction error

rdiaz rdiaz at inner.es
Fri Oct 19 15:22:52 CEST 2001


Dear All,

I often come across observational data sets, where the interest is in predicting
the class membership (often more than 2 classes) as a function of several
variables; generally, the number of predictors is very large, and it is also of
interest to make that number as small as possible (for instance, to minimize length
of future questionaires). I thought that a possible approach would be to use some
kind of stepwise model selection; as criterion for variable selection I would use
the prediction error from models fitted with "multinom" (package nnet), where the
prediction error would be obtained using k-fold cross-validation.

I have seen somewhat similar approaches, but not this one in particular, and since
I'd think the general situation is fairly common to many people,  I am wondering
whether the idea makes sense, or if it is a completely misguided and boneheaded
one.

(I think this is relatively easy to implement, comparing the results of
predict.multinom with the true class membership of the hold-out sets; that would be
the value returned by the function "extractAIC.mycvmultinom", and then I would be
able to just call stepAIC on objects of class "mycvmultinom").

Thanks,

Ramón Díaz
Inner Research
Velázquez 109
28006 Madrid
Spain


-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-
r-help mailing list -- Read http://www.ci.tuwien.ac.at/~hornik/R/R-FAQ.html
Send "info", "help", or "[un]subscribe"
(in the "body", not the subject !)  To: r-help-request at stat.math.ethz.ch
_._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._



More information about the R-help mailing list