I think it might be worth committing in steps. The standalone clustering and utility code has almost no impact on existing Mahout code (what small impacts there were on Vector and friends were committed some time ago). These can be committed sooner.
Integration with the map-reduce and command line stuff might take a bit longer to review. This can be reviewed and committed separately. I would particularly like Shannon and Jeff's opinions about how the new clustering fits into the existing framework. There is talk of a second edition of Mahout in Action and this new clustering would be a major new capability to be covered in that so fitting in well is important. On Mon, Mar 4, 2013 at 7:02 AM, Grant Ingersoll <[email protected]> wrote: > > Where do we go from here? Do I open JIRA issues for the changes? Do I > > first merge changes to the existing Mahout classes? > > I believe there is a JIRA already open for it (if not, open one). A patch > that can be applied to trunk/master with all tests passing would be best. > Any patch that more or less shows what is done is also welcome, although > it is a bit harder to consume.
