Ioan Barbulescu created OPENNLP-597:
---------------------------------------
Summary: Code in tools/parser throws some NullPointerExceptions
when dealing with poor training data
Key: OPENNLP-597
URL: https://issues.apache.org/jira/browse/OPENNLP-597
Project: OpenNLP
Issue Type: Bug
Components: Parser
Affects Versions: tools-1.5.3
Environment: Windows 7 + java 1.7.0_21
Reporter: Ioan Barbulescu
Priority: Minor
Fix For: 1.6.0
I was trying to train the Treebank Parser with some new data.
Truth to be told, the data was in poor format. Specifically, instead of "(-RRB-
-RRB-)", it contained "( -RRB-)".
The same for -LRB- constructions.
Due to this input data, the parsing code was throwing some NullPointerException
errors.
The fixes consist in some supplementary "if()"s, to safeguard against null
pointers.
Fixes are in 3 files, attached as diff. The diff was created by svn, run in the
opennlp-tool/.../parser directory.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira