That'd be great. I'd say you are welcome to begin working on this within the wiki: https://cwiki.apache.org/confluence/display/MAHOUT/Mahout+Wiki
I can help with anything recommender related. For others, I'd ask the apparent author or if that doesn't work as the mailing list. On Thu, Aug 12, 2010 at 11:01 PM, Joe Kumar <[email protected]> wrote: > Hi all, > > I am a beginner wrt Mahout and am trying to learn its architecture and how > it works. This can help me to implement some ML algos for Mahout. > To understand the big picture and end-end flow of an algorithm, I am not > able to find any good documentation (I have tried searching thru google, > mailing list, mahout site..). So I am thinking of writing some documentation > so that new developers would find it easy to understand the architecture / > end-end flow and start designing / coding new algos. > Can someone please point me in the right direction as to where I can start > and what to refer etc... > > I am thinking of starting off with 1 classification (probably Naive Bayes) > and create a template for the documentation like > 1. Overview of the Algo > 2. I/P data set (how to prepare and sample data set) > 3. Maybe a sequence diagram explaining how the code flow happens (or any > other way of representing this info ??) > 4. O/P (how to read the o/p model and apply it for a real-world > classification problem) > > If you have any quick pointers on the design of Naive bayes / any info you > want added to the document template, plz let me know.. > would appreciate any guidance regarding this.. > goal : new developers can quickly ramp up and understand how an algo is > implemented so they can re-use etc effectively.. > > i understand many are already mentoring for GSOC but if someone has time to > mentor me in this effort, I'll be glad to submit a formal application > through http://community.apache.org/mentoringprogramme.html. > > thanks, > Joe. >
