Hi all, 

I recently started working with OpenNLP for a project in the area of text 
classification with neural networks. So far, OpenNLP is a great library and 
very useful. 
There are just three things that I haven't been able to find, but maybe they do 
exist: 
- language models: e.g. to create a bigram language model with relative and 
absolute frequencies from several texts 
- stemming: to reduce different word forms in inflected languages to a 
canonical root form
- stoplist: to remove certain words (e.g. from the language model) that are 
deemed irrelevant

Do these functions exist in OpenNLP? If not, can you recommend another library 
to complement these functions? 

Kind regards, 

Martin


Attachment: signature.asc
Description: Message signed with OpenPGP using GPGMail

Reply via email to