Am 24.08.2013 um 21:03 schrieb Daniel Naber <list2...@danielnaber.de>:

> On 2013-08-24 20:28, Richard Eckart de Castilho wrote:
> 
>> Are you going to build a chunker from scratch or rely on existing
>> technology, e.g. the OpenNLP Chunker [1]?
> 
> I'll use the one from OpenNLP for now. It's kind of a black box for us, 
> so I'm not sure yet how to handle those cases where OpenNLP gets it 
> wrong. Any ideas about that?

I'm not familiar with its details, but given that it can be trained, it
would probably be a good solution to start building a corpus of those sentences
it gets wrong and retrain every once in a while with the original corpus plus
the manually corrected samples. 

-- Richard
------------------------------------------------------------------------------
Introducing Performance Central, a new site from SourceForge and 
AppDynamics. Performance Central is your source for news, insights, 
analysis and resources for efficient Application Performance Management. 
Visit us today!
http://pubads.g.doubleclick.net/gampad/clk?id=48897511&iu=/4140/ostg.clktrk
_______________________________________________
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel

Reply via email to