On Fri, Oct 12, 2012 at 11:45 PM, Ted Dunning <[email protected]> wrote: > Review the knn code from github > > File an individual contributors license agreement with Apache > > Change knn to fit the Mahout API > > Push back to Mahout > > Solicit current clustering users for metrics on their data (I can help with > this) > > Write up data generation strategy with useable results > > Not sure how long these tasks are because they are a bit big for planning > purposes, but give a decent outline.
Okay, first, let me start looking at the problem we're solving (kNN), what approaches there are now and what approaches you implemented (i.e., read the papers, presentations and code). As I go, I'd like to update my (somewhat unused) blog, (danf.wordpress.com) to keep track of my progress and let other people know how it's working out. As for us communicating, should I ask questions on this dev@ list or e-mail you directly? > > On Fri, Oct 12, 2012 at 1:34 PM, Dan Filimon > <[email protected]>wrote: > >> Now, where do I start? What would a plan for the coming months look like? >>
