indeed, for k-means clustering you should specify the path to the clusters directory where the binary part-00000 file is, and not the actual binary file as the input for the sequence file (-s).
that is <path to clusters output>/clusters-<last iteration number>/ instead of <path to clusters output>/clusters-<last iteration number>/part-00000 clusterdump worked fine for me with the following command ./bin/mahout clusterdump -s <path to clusters output>/clusters-<last iteration number>/ \ -o <path for dump file> \ -p <path to clusters output>/clusteredPoints/ \ -d <path to input vectors>/dictionary.file-0 \ -dt sequencefile ----- --cheers Delroy -- View this message in context: http://lucene.472066.n3.nabble.com/Kmeans-clustering-tp641973p853194.html Sent from the Mahout User List mailing list archive at Nabble.com.
