[R] Cluster analysis using numeric and factor variables

Moshe Olshansky m_olshansky at yahoo.com
Wed Jun 11 04:13:13 CEST 2008


If you can define a distance between two vectors (where each one has some numerical and some categorical coordinates) then you can proceed with any clustering algorithm.

One possibility to get such a distance is to use RandomForest which can produce a proximity matrix which can be turned into distance matrix.

Regards,

Moshe.

--- On Wed, 11/6/08, Nagu <thogiti at gmail.com> wrote:

> From: Nagu <thogiti at gmail.com>
> Subject: [R] Cluster analysis using numeric and factor variables
> To: r-help at r-project.org
> Received: Wednesday, 11 June, 2008, 11:49 AM
> Hi,
> 
> Are there any algorithms that handle numeric and factor
> variables
> together in a cluster analysis?
> 
> Thank you,
> Nagu
> 
> ______________________________________________
> R-help at r-project.org mailing list
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide
> http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained,
> reproducible code.



More information about the R-help mailing list