I'm trying to run canopyClustering followed by KMeansClustering Here is the code:
*Path canopyCentroids = new Path(clusteringOutputFolder, > "canopy-centroids");* * Path clusterOutput = new Path(clusteringOutputFolder, "clusters"); * > *CanopyDriver.run (inputVectors, canopyCentroids, new > TanimotoDistanceMeasure(),**0.95 ,**1.05 ,* > * true, 0.0, false); * > *KMeansDriver.run (conf, inputVectors, new > Path(canopyCentroids + "/clusters-0-final"), clusterOutput, **new > TanimotoDistanceMeasure(), 0.01, 20, true, **0,false);* Whenever i run this on my dev machine, it runs fine but when i export it to the cluster and run, i always get this error message: j > *ava.lang.RuntimeException: java.lang.RuntimeException: > java.lang.InterruptedException: Cluster Iteration 1 failed processing* > *attempt_201404230954_13426_r_000001_0: SLF4J: > See http://www.slf4j.org/codes.html#multiple_bindings > <http://www.slf4j.org/codes.html#multiple_bindings>for an explanation. * > *attempt_201404230954_13426_r_000001_0: > org.apache.mahout.math.CardinalityException: Required cardinality 370 but > got 356* > *attempt_201404230954_13426_r_000001_0: at > org.apache.mahout.math.AbstractVector.dot(AbstractVector.java:180) * > *attempt_201404230954_13426_r_000001_0: at > org.apache.mahout.common.distance.TanimotoDistanceMeasure.distance(TanimotoDistanceMeasure.java:49)* > *attempt_201404230954_13426_r_000001_0: at > org.apache.mahout.clustering.iterator.DistanceMeasureCluster.pdf(DistanceMeasureCluster.java:66) > * > *attempt_201404230954_13426_r_000001_0: at > org.apache.mahout.clustering.iterator.DistanceMeasureCluster.pdf(DistanceMeasureCluster.java:32)* > *attempt_201404230954_13426_r_000001_0: at > org.apache.mahout.clustering.iterator.AbstractClusteringPolicy.classify(AbstractClusteringPolicy.java:59) > * > *attempt_201404230954_13426_r_000001_0: at > org.apache.mahout.clustering.classify.ClusterClassifier.classify(ClusterClassifier.java:99)* > *attempt_201404230954_13426_r_000001_0: at > org.apache.mahout.clustering.iterator.CIMapper.map(CIMapper.java:36) > **attempt_201404230954_13426_r_000001_0: > at org.apache.mahout.clustering.iterator.CIMapper.map(CIMapper.java:17)* Please help. I don't know where i'm going wrong here. -- Regards- Pavan
