Hello Friends,

Whats the heuristic for providing what number of iterations for
RepresentativePointsDriver?

I have run kmeans and fuzzy-kmeans algorithm on a dataset of size 500MB.
Now, how do I obtain cluster quality?

Does the following look Okay? :
RepresentativePointsDriver.run(conf, new Path(clustersIn), new
Path(clusteredPointsIn), new Path(outputDir), new
EuclideanDistanceMeasure(), numIterations, runSequential);
double interDis = clusterEval.interClusterDensity();
double intraDis = clusterEval.intraClusterDensity();
System.out.println("cluster evaluator: The inter distance: "+interDis);
System.out.println("cluster evaluator: The intra distance: "+intraDis);



-- 
Regards,
Rahul K Mishra,
https://sites.google.com/site/reachrahulkmishra/

Reply via email to