The new clustering code has several papers attached to the github repo. I have given several talks, the most detailed is the one I gave at Oxford a weeks ago. You can get those slides from slideshare under my name.
Mahout has a clustering interface that is best learned from the code. On Fri, Oct 12, 2012 at 1:34 PM, Dan Filimon <[email protected]>wrote: > > Now, where do I start? What would a plan for the coming months look like? > Should I start by first reading the theory? Learn more about Mahout? > >
