[
https://issues.apache.org/jira/browse/OPENNLP-1306?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17509943#comment-17509943
]
ASF GitHub Bot commented on OPENNLP-1306:
-----------------------------------------
jzonthemtn commented on pull request #406:
URL: https://github.com/apache/opennlp/pull/406#issuecomment-1074024891
This is a pull request from a community submitted patch file on
[OPENNLP-1306](https://issues.apache.org/jira/projects/OPENNLP/issues/OPENNLP-1306)
to add the sentence to the name finder training error message to provide more
context about the problem.
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]
> NameSample overlap exception not helpful
> ----------------------------------------
>
> Key: OPENNLP-1306
> URL: https://issues.apache.org/jira/browse/OPENNLP-1306
> Project: OpenNLP
> Issue Type: Improvement
> Components: Name Finder
> Affects Versions: 1.9.2
> Reporter: Markus Jelsma
> Priority: Major
> Attachments: OPENNLP-1306.patch
>
>
> I got this for some very large training file.
> {code:java}
> Computing event counts... Exception in thread "main"
> java.lang.RuntimeException: name spans [27..29) person and [27..27) person
> are overlapped in file: null
> at opennlp.tools.namefind.NameSample.<init>(NameSample.java:79)
> at opennlp.tools.namefind.NameSample.<init>(NameSample.java:97)
> at opennlp.tools.namefind.NameSample.<init>(NameSample.java:101)
> {code}
> With this exception it is impossible to track the error if you have a large
> training file.
>
> Exceptions about mismatching <START:> and <END> tags at least give a little
> bit of context. This patch adds the sentence parts to the exception, making
> it simple to grep the training file for the bad sentence.
--
This message was sent by Atlassian Jira
(v8.20.1#820001)