[ 
https://issues.apache.org/jira/browse/OPENNLP-1306?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17510553#comment-17510553
 ] 

ASF GitHub Bot commented on OPENNLP-1306:
-----------------------------------------

jzonthemtn merged pull request #406:
URL: https://github.com/apache/opennlp/pull/406


   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


> NameSample overlap exception not helpful
> ----------------------------------------
>
>                 Key: OPENNLP-1306
>                 URL: https://issues.apache.org/jira/browse/OPENNLP-1306
>             Project: OpenNLP
>          Issue Type: Improvement
>          Components: Name Finder
>    Affects Versions: 1.9.2
>            Reporter: Markus Jelsma
>            Priority: Major
>         Attachments: OPENNLP-1306.patch
>
>
> I got this for some very large training file.
> {code:java}
>          Computing event counts...  Exception in thread "main" 
> java.lang.RuntimeException: name spans [27..29) person and [27..27) person 
> are overlapped in file: null
>         at opennlp.tools.namefind.NameSample.<init>(NameSample.java:79)
>         at opennlp.tools.namefind.NameSample.<init>(NameSample.java:97)
>         at opennlp.tools.namefind.NameSample.<init>(NameSample.java:101)
> {code}
> With this exception it is impossible to track the error if you have a large 
> training file.
>  
> Exceptions about mismatching <START:> and <END> tags at least give a little 
> bit of context. This patch adds the sentence parts to the exception, making 
> it simple to grep the training file for the bad sentence.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

Reply via email to