2011/8/18 Jörn Kottmann <[email protected]>: > Hi all, > > the contribution from Boris contains a Porter stemmer. > > Up to now we do not have support for stemming in OpenNLP. > Should we add a component to OpenNLP which is dedicated to stemming? > > I believe that could be useful for many, and could also be useful as part of > our feature generation. To start with we might only have the Porter stemmer, > but that could easily be extended to more languages over time.
Is this better or cover more languages than what's already provided by Apache Lucene? Maybe it should better be contributed to the Lucene project and make it easy to use the generic, battle tested Lucene analyzers / tokenizers infrastructure to generate features in OpenNLP. -- Olivier http://twitter.com/ogrisel - http://github.com/ogrisel
