[ 
https://issues.apache.org/jira/browse/OPENNLP-597?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13771148#comment-13771148
 ] 

Joern Kottmann commented on OPENNLP-597:
----------------------------------------

Can you attach a sentence of your training data which shows this problem? If 
the data format does not match the expected format we might instead want to 
make it fail in a meaningful way.
                
> Code in tools/parser throws some NullPointerExceptions when dealing with poor 
> training data
> -------------------------------------------------------------------------------------------
>
>                 Key: OPENNLP-597
>                 URL: https://issues.apache.org/jira/browse/OPENNLP-597
>             Project: OpenNLP
>          Issue Type: Bug
>          Components: Parser
>    Affects Versions: tools-1.5.3
>         Environment: Windows 7 + java 1.7.0_21 
>            Reporter: Ioan Barbulescu
>            Priority: Minor
>             Fix For: 1.6.0
>
>         Attachments: tools.patch
>
>
> I was trying to train the Treebank Parser with some new data.
> Truth to be told, the data was in poor format. Specifically, instead of 
> "(-RRB- -RRB-)", it contained "( -RRB-)".
> The same for -LRB- constructions.
> Due to this input data, the parsing code was throwing some 
> NullPointerException errors.
> The fixes consist in some supplementary "if()"s, to safeguard against null 
> pointers.
> Fixes are in 3 files, attached as diff. The diff was created by svn, run in 
> the opennlp-tool/.../parser directory.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to