Hello Friends,
Whats the heuristic for providing what number of iterations for
RepresentativePointsDriver?
I have run kmeans and fuzzy-kmeans algorithm on a dataset of size 500MB.
Now, how do I obtain cluster quality?
Does the following look Okay? :
RepresentativePointsDriver.run(conf, new Path(clustersIn), new
Path(clusteredPointsIn), new Path(outputDir), new
EuclideanDistanceMeasure(), numIterations, runSequential);
double interDis = clusterEval.interClusterDensity();
double intraDis = clusterEval.intraClusterDensity();
System.out.println("cluster evaluator: The inter distance: "+interDis);
System.out.println("cluster evaluator: The intra distance: "+intraDis);
--
Regards,
Rahul K Mishra,
https://sites.google.com/site/reachrahulkmishra/