Re: OpenNLP 1.5.3 ....

jim.foobar Thu, 03 May 2012 04:35:38 -0700

On 03/05/12 12:16, Jörn Kottmann wrote:

On 05/03/2012 10:58 AM, Jim - FooBar(); wrote:
I can also provide the "AggregateNameFinder" class which takes anynumber of name-finders and merges their results in order to getbetter evaluation statistics. Internally, it uses the"NameFinderME.dropOverlappingSpans()" method to get rid of nestedspans, which however does the simplistic thing of keeping theearliest span (ignoring the type of the span completely). I thinkbeing able to merge results from several name-finders is a killerfeature that a lot of people will appreciate even if i don't thinkkeeping the earliest span is sensible when trying to evaluate severalfinders on multiple entity types...
+1 to implement it based on NameFinderME.dropOverlappingSpans.
In my opinion that is still a good baseline. We can come up with morespecialized and sophisticatedapproaches e.g. based on probabilities and limited for statisticalname finders.
Jörn

Yes, I agree it is not a bad baseline, but pretty soon we'll have toeither look at the probabilities (if someone is trying to merge severalmodels) or at the actual class of the namefinder that gave a particularprediction and reason on that...for example if a prediction came from adictionary there is really no point in doubting it is there? It must becorrect! anyway, i'd love to see this feature on 1.5.3 and a couple ofweeks (what William needs) is not that long...

Jim

ps: btw, I 've been actually using the aggregate name-finder in myprivate build for almost 3 weeks now...I'm passing it 2 dictionaryfinders of different types and a maxent model that can also predict 2types. Everything works just fine! :)

Re: OpenNLP 1.5.3 ....

Reply via email to