[R] PCA and categorical data

(Ted Harding) Ted.Harding at manchester.ac.uk
Fri Mar 6 10:46:17 CET 2009


On 06-Mar-09 09:25:26, Prof Brian Ripley wrote:
> You might want to look into correspondence analysis, which has several 
> variants of PCA designed for categorical data.

In particular, have a look at the results of

  RSiteSearch("correspondence")

Ted.

> On Fri, 6 Mar 2009, Galanidis Alexandros wrote:
> 
>> Hi all,
>>
>> I' m trying to figure out if it is appropriate to do a PCA having only
>> categorical data (not ordinal). I have only find the following quote:
>>
>> One method to find such relationships is to select appropriate
>> variables and
>> to view the data using a method like Principle Components Analysis
>> (PCA) [4].
>> This approach gives us a clear picture of the data using KL-plot of
>> the PCA.
>> However, the method is not settled for the data including categorical
>> data.
>> [http://hp.vector.co.jp/authors/VA038807/personal/covEigGiniRep17.pdf]
>>
>> but I'm still not sure if it WRONG to do so.
> 
> Since normally categorical data is taken to be binomial or Poisson 
> distributed, the variance varies with the mean and least-squares (the 
> basis of PCA) is then sub-optimal.  Correspondence analysis takes that 
> into account (at least to some extent).
> 
>> Any opinion or reference would be very helpful
> 
> There is a basic introduction in MASS4, with references to more 
> comprehensive accounts.
> 
>> thanks
>>
>> ______________________________________________
>> R-help at r-project.org mailing list
>> https://stat.ethz.ch/mailman/listinfo/r-help
>> PLEASE do read the posting guide
>> http://www.R-project.org/posting-guide.html
>> and provide commented, minimal, self-contained, reproducible code.
>>
> 
> -- 
> Brian D. Ripley,                  ripley at stats.ox.ac.uk
> Professor of Applied Statistics,  http://www.stats.ox.ac.uk/~ripley/
> University of Oxford,             Tel:  +44 1865 272861 (self)
> 1 South Parks Road,                     +44 1865 272866 (PA)
> Oxford OX1 3TG, UK                Fax:  +44 1865 272595
> 
> ______________________________________________
> R-help at r-project.org mailing list
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide
> http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.

--------------------------------------------------------------------
E-Mail: (Ted Harding) <Ted.Harding at manchester.ac.uk>
Fax-to-email: +44 (0)870 094 0861
Date: 06-Mar-09                                       Time: 09:46:15
------------------------------ XFMail ------------------------------




More information about the R-help mailing list