[R] Regularized Discriminant Analysis scores, anyone?

Matthew Fagan mef2153 at columbia.edu
Sun Jun 2 17:57:41 CEST 2013


Thank you Dr. Ligges, i very much appreciate the quick reply.  i 
wondered if that was the case, based on the math as I (poorly) 
understood it.  However i remain confused.   page 107 from the "rrcov" 
package PDF makes me think I can derive LDA-style discriminant scores 
for a QDA:

library(rrcov)
data(iris)
qda1<-QdaClassic(x=iris[,1:4], grouping=iris[,5])
pred_qda<-predict(qda1, iris[,1:4])
head(pred_qda at x)
plotdat<-pred_qda at x
plot(plotdat[,1], plotdat[,2])
plot(plotdat[,2], plotdat[,3])

pred_qda$x looks like QDA discriminant scores.   No doubt you are right, 
but if you have a moment, I'd love to know what these scores are and 
what they summarize.

In addition, I have run into this nice set of lengthy R code to manually 
calculate discriminant scores for a QDA:
https://cs.uwaterloo.ca/~a2curtis/courses/2005/ML-classification.pdf

None of this means i can calculate discriminant scores for a RDA, of 
course, but QDA is my back-up choice.

Bottom line: am i am completely misinterpreting what I am seeing here, 
mathematically?  Or is this just the result of different ways of 
implementing QDA in R?

Regards, and thanks again,
Matt


On 6/2/2013 10:39 AM, Uwe Ligges wrote:
>
>
> On 02.06.2013 05:01, Matthew Fagan wrote:
>> Hi all,
>>
>> I am attempting to do Regularized Discriminant Analysis (RDA) on a large
>> dataset, and I want to extract the RDA  discriminant score matrix.  But
>> the predict function in the "klaR" package, unlike the predict function
>> for LDA in the "MASS" package, doesn't seem to give me an option to
>> extract the scores.  Any suggestions?
>
> There are no such scores:
>
> same as for qda, you do not follow the Fisher idea of the linear 
> discriminant components any more: Your space is now partitioned by 
> ellipsoid like structures based on the estimation of the inner-class 
> covariance matrices.
>
> rda as implemented in klaR (see the reference given on the help page) 
> is a regularization that helps to overcome problems when estimating 
> non-singular covariance matrices for the separate classes.
>
>
>> i have already tried (and failed; ran out of 16 GB of memory) to do this
>> with the "rda" package: don't know why, but the klaR package seems to be
>> much more efficient with memory.  I have included an example below:
>
> The rda package provides a completely different regularization 
> technique, see the reference given on the help page.
>
> Best,
> Uwe Ligges
>
>
>
>
>> library(klaR)
>> library(MASS)
>>
>> data(iris)
>>
>> x <- rda(Species ~ ., data = iris, gamma = 0.05, lambda = 0.2)
>> rda1<-predict(x, iris[, 1:4])
>> str(rda1)
>>
>> #  This gets you an object with posterior probabilities and classes, but
>> no discriminant scores!
>>
>> #  if you run lda
>>
>> y <- lda(Species ~ ., data = iris)
>> lda1<-predict(y, iris[, 1:4])
>> str(lda1)
>>
>> head(lda1$x)  #  gets you the discriminant scores for the LDA. But how
>> to do this for RDA?
>>
>> #  curiously, the QDA function in MASS has this same problem, although
>> you can get around it using the rrcov package.
>>
>> Regards, and thank very much for any help,
>> Matt
>>
>> ______________________________________________
>> R-help at r-project.org mailing list
>> https://stat.ethz.ch/mailman/listinfo/r-help
>> PLEASE do read the posting guide
>> http://www.R-project.org/posting-guide.html
>> and provide commented, minimal, self-contained, reproducible code.
>

-- 
Matthew Fagan
Columbia University
Department of Ecology, Evolution, and Environmental Biology
512-569-1417 (cell/home)
(212) 854-9987 (office)
(212) 854-8188 (fax)



More information about the R-help mailing list