Mahout in general uses sequence files for input and output. These are
binary encoded files that can only be read by a compatible program. If
you are trying to e.g. less .../part-xxx then you won't see much that is
human readable. You can run the ClusterDumper to get human readable
output from running any clustering job.
On 12/27/11 3:26 PM, Keren Ouaknine wrote:
Hello,
I am running the KMeans sample:
$MAHOUT_HOME/bin/mahout
org.apache.mahout.clustering.syntheticcontrol.kmeans.Job
Following:
https://cwiki.apache.org/MAHOUT/clustering-of-synthetic-control-data.html
I uploaded the sythetic_data and this example runs fine, however I get
gibrish output while looking at the output dir.
Also, I would like to find out how many centroids are in this sample, and
what is their initial locations?
I tried to dig into the code, but src files dont seem to be included in the
distribution
Thanks,
Keren