Re: Judging the quality of clustering

Jeff Eastman Wed, 16 May 2012 07:32:35 -0700

Mahout has a ClusterEvaluator and a CDbwEvaluator that compute somequality metrics (inter-cluster distance, intra-cluster-distance, ...)that you may find useful. Both calculate a set of representative pointsfrom the clustering output and compute the (n^2) metrics over thesepoints rather than all of the points in each cluster.


On 5/15/12 4:46 PM, Pat Ferrel wrote:

So many questions about best k, how to choose t1 and t2, how much helpis dimensional reduction would have clear answers if we had a way tojudge the quality of clusters.
Various methods were discussed here for a time:http://www.lucidimagination.com/search/document/dab8c1f3c3addcfe/validating_clustering_output
Has there been any work on building a measure of quality?

Re: Judging the quality of clustering

Reply via email to