Here is a quick walkthrough for doing kmeans clustering and looking at
the input and output.
https://cwiki.apache.org/confluence/display/MAHOUT/Quick+tour+of+text+analysis+using+the+Mahout+command+line
Be aware that some command line params have changed since it was written
for 0.6. For instance -s has changed to -i in some cases (as I recall).
Also clusterdump needs an output file now so will not output to the
terminal. When in doubt try the command with no params to get help.
The mahout documentation needs a bit of cleanup. Too see all the
available docs try the "view in hierarchy" format for the
cwiki.apache.org here it shows some docs not linked to in any other ways
I can find.
https://cwiki.apache.org/confluence/pages/listpages-dirview.action?key=MAHOUT&openId=74539#selectedPageInHierarchy
Also I highly recommend Mahout in Action by Manning press.
On 7/20/12 1:59 AM, Videnova, Svetlana wrote:
That's a very good question, I was expecting an answer too...
That was the answer giver to me from mahout users:
" the type of input and output depends on the job you want to run."
I was clustering .txt files for the moment.
-----Message d'origine-----
De : shriram [mailto:[email protected]]
Envoyé : vendredi 20 juillet 2012 10:52
À : [email protected]
Objet : RE: k-means output missing some cluster centers coordinates
what should be the input format for mahout??? can anybody tell me.. I'm
confused.. I'm not able to make head or tail out of the output that I'm getting
--
View this message in context:
http://lucene.472066.n3.nabble.com/k-means-output-missing-some-cluster-centers-coordinates-tp1919928p3996138.html
Sent from the Mahout User List mailing list archive at Nabble.com.
Think green - keep it on the screen.
This e-mail and any attachment is for authorised use by the intended
recipient(s) only. It may contain proprietary material, confidential
information and/or be subject to legal privilege. It should not be copied,
disclosed to, retained or used by, any other party. If you are not an intended
recipient then please promptly delete this e-mail and any attachment and all
copies and inform the sender. Thank you.