Each iteration will add a single point to the evolving list of representative points for each cluster. So, I think it depends on the number of vectors per cluster and also the intra cluster distance.
On Thu, Nov 1, 2012 at 8:13 PM, Rahul Mishra <[email protected]>wrote: > Hello Friends, > > Whats the heuristic for providing what number of iterations for > RepresentativePointsDriver? > > I have run kmeans and fuzzy-kmeans algorithm on a dataset of size 500MB. > Now, how do I obtain cluster quality? > > Does the following look Okay? : > RepresentativePointsDriver.run(conf, new Path(clustersIn), new > Path(clusteredPointsIn), new Path(outputDir), new > EuclideanDistanceMeasure(), numIterations, runSequential); > double interDis = clusterEval.interClusterDensity(); > double intraDis = clusterEval.intraClusterDensity(); > System.out.println("cluster evaluator: The inter distance: "+interDis); > System.out.println("cluster evaluator: The intra distance: "+intraDis); > > > > -- > Regards, > Rahul K Mishra, > https://sites.google.com/site/reachrahulkmishra/ >
