AW: [R] non-hierarchical non-exclusive clustering of large data sets

TEMPL Matthias Matthias.Templ at statistik.gv.at
Mon May 24 16:39:33 CEST 2004


I think the "cmeans" method in library(e1071) works better for large data sets as "fanny".

(note,not for this data: fanny has also problems with standardized large data sets - here produce fanny the same memberships for all observations; cmeans works "correctly")

Matthias


> -----Ursprüngliche Nachricht-----
> Von: Bhaskar S. Manda [mailto:bhaskar at cacmnet.com] 
> Gesendet: Montag, 24. Mai 2004 17:12
> An: r-help at stat.math.ethz.ch
> Betreff: Re: [R] non-hierarchical non-exclusive clustering of 
> large data sets
> 
> 
> On Mon, 24 May 2004 15:58:57 +0200, Murk Wuite wrote: 
> > I'm trying to use R to cluster words with related meanings. Does 
> > anyone know of a non-hierarchical clustering method in R 
> that produces 
> > non-exclusive clusters? With non-exclusive, I mean that words should
> 
> The "fanny" method in library(cluster) outputs probabilities 
> of membership in each cluster.
> 
> > the one above, its dimensions would be in the order of (100000, 
> > 100000). Does anyone know if this would cause practical problems,  
> > perhaps very slow clustering?
> 
> I had a much smaller matrix, 4000x3, fanny took about 4 
> minutes wall clock time on a lightly loaded (there were many 
> other processes, but none
> computational) 1.4 GHz Athlon, It was completely CPU-bound. 
> 
> --
> bhaskar
> 
> ______________________________________________
> R-help at stat.math.ethz.ch mailing list 
> https://www.stat.math.ethz.ch/mailman/listinfo> /r-help
> PLEASE 
> do read the posting guide! 
> http://www.R-project.org/posting-guide.html
>




More information about the R-help mailing list