[
https://issues.apache.org/jira/browse/OPENNLP-1078?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16023704#comment-16023704
]
Jim Regan commented on OPENNLP-1078:
------------------------------------
Ok, I've run a quick perl script over the whole sentence bank and there's a
fairly significant number of sentences where there are more tokens than lemmas.
I'll put in a check to skip those tomorrow, and work with the creator to fix
the errors in the original.
> Irish Sentence Bank format support throws NPE
> ---------------------------------------------
>
> Key: OPENNLP-1078
> URL: https://issues.apache.org/jira/browse/OPENNLP-1078
> Project: OpenNLP
> Issue Type: Bug
> Components: Formats
> Reporter: Joern Kottmann
> Priority: Minor
> Fix For: 1.8.1
>
>
> It happens when trying to match the surface form with its lemma, on line 245
> of
> IrishSentenceBankDocument.java:
> int rsize = flx.get(flexidx).size();
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)