In the folder clusters what I used there is this file: part-randomSeed
created by the command:
bin/mahout kmeans -i /home/vuvvo/reuters-out-seqdir-sparse/tfidf-vectors/ -c
/home/vuvvo/clusters -o /home/vuvvo/reuters-kmeans -k 3 --maxIter 5
I need to use the files in the folder reuters-kmeans? inside it I have got some
other sub-directory called cluster-x where x is from 1 to 5.
I tried to give the cluster-5 as input and inside finalOutput I have got a file
big 1,4 Mb but very hard to open also with 4 mb of ram on a 64 bit ^^
I can read the first and the second row:
CL-21551 {n=1855 c =[1:0.011,2:0.005...to 31:0.012
and the second row:
VL-21560{n=19722 c[0:0.012 etc etc...
is now converged and correct?
is there a more comfortable way to read this file?because than i need to convert
it in data for weka.
This night i will try to convert an irff data in a sgm to see what i obtain with
it.