Sure. However, the immediate contribution is data. src/main/resources? Something else?
On Sat, Jan 16, 2010 at 10:16 AM, Olivier Grisel <olivier.gri...@ensta.org> wrote: > 2010/1/16 Grant Ingersoll <gsing...@apache.org>: >> I think we should start a new module, that will be the seed for a >> subproject, called NLP and that contains the stuff for NLP. >> >> Either that or put them in the utils module, which is where I envision all >> of things that are "helpful" for ML go, but aren't required. > > +1 for an explicit "org.apache.mahout.nlp module". Tools to turn > wikipedia dumps into term freq vectors could also move there instead > of "examples". > > -- > Olivier > http://twitter.com/ogrisel - http://code.oliviergrisel.name >