Markus Jelsma created OPENNLP-1306:
--------------------------------------
Summary: NameSample overlap exception not helpful
Key: OPENNLP-1306
URL: https://issues.apache.org/jira/browse/OPENNLP-1306
Project: OpenNLP
Issue Type: Improvement
Components: Name Finder
Affects Versions: 1.9.2
Reporter: Markus Jelsma
I got this for some very large training file.
{code:java}
Computing event counts... Exception in thread "main"
java.lang.RuntimeException: name spans [27..29) person and [27..27) person are
overlapped in file: null
at opennlp.tools.namefind.NameSample.<init>(NameSample.java:79)
at opennlp.tools.namefind.NameSample.<init>(NameSample.java:97)
at opennlp.tools.namefind.NameSample.<init>(NameSample.java:101)
{code}
With this exception it is impossible to track the error if you have a large
training file.
Exceptions about mismatching <START:> and <END> tags at least give a little bit
of context. This patch adds the sentence parts to the exception, making it
simple to grep the training file for the bad sentence.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)