Here is a quick walkthrough for doing kmeans clustering and looking at the input and output.
https://cwiki.apache.org/confluence/display/MAHOUT/Quick+tour+of+text+analysis+using+the+Mahout+command+line
Be aware that some command line params have changed since it was written for 0.6. For instance -s has changed to -i in some cases (as I recall). Also clusterdump needs an output file now so will not output to the terminal. When in doubt try the command with no params to get help.

The mahout documentation needs a bit of cleanup. Too see all the available docs try the "view in hierarchy" format for the cwiki.apache.org here it shows some docs not linked to in any other ways I can find.
https://cwiki.apache.org/confluence/pages/listpages-dirview.action?key=MAHOUT&openId=74539#selectedPageInHierarchy

Also I highly recommend Mahout in Action by Manning press.

On 7/20/12 1:59 AM, Videnova, Svetlana wrote:
That's a very good question, I was expecting an answer too...

That was the answer giver to me from mahout users:
" the type of input and output depends on the job you want to run."

I was clustering .txt files for the moment.

-----Message d'origine-----
De : shriram [mailto:[email protected]]
Envoyé : vendredi 20 juillet 2012 10:52
À : [email protected]
Objet : RE: k-means output missing some cluster centers coordinates

what should be the input format for mahout??? can anybody tell me.. I'm 
confused.. I'm not able to make head or tail out of the output that I'm getting



--
View this message in context: 
http://lucene.472066.n3.nabble.com/k-means-output-missing-some-cluster-centers-coordinates-tp1919928p3996138.html
Sent from the Mahout User List mailing list archive at Nabble.com.


Think green - keep it on the screen.

This e-mail and any attachment is for authorised use by the intended 
recipient(s) only. It may contain proprietary material, confidential 
information and/or be subject to legal privilege. It should not be copied, 
disclosed to, retained or used by, any other party. If you are not an intended 
recipient then please promptly delete this e-mail and any attachment and all 
copies and inform the sender. Thank you.






Reply via email to