[R] algorithm for clustering categorical data

Li, Yan Yan_Li at ibi.com
Thu Aug 1 18:19:04 CEST 2013


Thanks for the reply....

>From the link you provided, only two packages mentioned categorical field: depmix and depmixS4. I'll look at them.



-----Original Message-----
From: David Winsemius [mailto:dwinsemius at comcast.net] 
Sent: Thursday, August 01, 2013 12:15 PM
To: Li, Yan
Cc: r-help at r-project.org
Subject: Re: [R] algorithm for clustering categorical data


On Aug 1, 2013, at 9:00 AM, Li, Yan wrote:

> Hi All,
> 
> Does anyone know what algorithm for clustering categorical variables? 
> R packages?

Many.

http://cran.r-project.org/web/views/Cluster.html

> Which is the best?

For what purpose?

> 
> If a data has both numeric and categorical data, what is the best 
> clustering algorithm to use and R package?
> 
> I tried numeric transformation of all categorical fields  and doing clustering afterwards. But the transformed fields have values from 1...10, and my other fields is in a bigger scale: 10000-...This will make the categorical fields has less effect on the distance calculation...
> 

This seems impossibly vague and confused. You are asked in the Posting Guide to provide a working example if you want help with code.

-- 

David.


David Winsemius
Alameda, CA, USA



More information about the R-help mailing list