2011/8/18 Jörn Kottmann <[email protected]>:
> Hi all,
>
> the contribution from Boris contains a Porter stemmer.
>
> Up to now we do not have support for stemming in OpenNLP.
> Should we add a component to OpenNLP which is dedicated to stemming?
>
> I believe that could be useful for many, and could also be useful as part of
> our feature generation. To start with we might only have the Porter stemmer,
> but that could easily be extended to more languages over time.

Is this better or cover more languages than what's already provided by
Apache Lucene? Maybe it should better be contributed to the Lucene
project and make it easy to use the generic, battle tested Lucene
analyzers / tokenizers infrastructure to generate features in OpenNLP.

-- 
Olivier
http://twitter.com/ogrisel - http://github.com/ogrisel

Reply via email to