I think you don’t want to supply a -c argument unless you have seed vectors in /user/netlog/upload/output4/uscensus-kmeans-centroids/part-randomSeed. Just leave it out and Mahout will use random seeds. BTW you’ll get help faster if you post to the user list On Mar 9, 2015, at 3:10 AM, Raghuveer <alwaysra...@yahoo.com.INVALID> wrote:
Hi All, I am trying to run the following command: ./mahout kmeans -i hdfs://master:54310/user/netlog/upload/output4/tfidf-vectors/part-r-00000 -o hdfs://master:54310//user/netlog/upload/output4/tfidf-vectors-kmeans-clusters-raghuveer -c hdfs://master:54310/user/netlog/upload/output4/uscensus-kmeans-centroids -dm org.apache.mahout.common.distance.CosineDistanceMeasure -x 5 -ow -cl -k 25 getting the following exception: IllegalStateException: No input clusters found in hdfs://master:54310/user/netlog/upload/output4/uscensus-kmeans-centroids/part-randomSeed. Check your -c argument. kindly suggest how i can get ride of this exception. Note : i see a vector in part-r-00000 but why it says "no input" is not clear to me. regards,.