[
https://issues.apache.org/jira/browse/OPENNLP-676?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Hugo Mougard updated OPENNLP-676:
---------------------------------
Description:
The AnnotationComboIterator helper class used by the UIMA POSTagger uses both
UIMA & Java style iterators. This is strongly discouraged:
https://uima.apache.org/downloads/releaseDocs/2.3.0-incubating/docs/api/org/apache/uima/cas/FSIterator.html
The consequence is that the AE breaks even on very simple CASes such as the CAS
showcased on this repository (text of 9 letters, 2 sentence annotations and 9
token annotations):
https://github.com/m09/postagger-iterator-bug/blob/master/in.xmi
The repository linked above contains a example program that crashes on my
setup. It's fully maven 3 aware so you can normally launch it quite easily.
was:
The AnnotationComboIterator helper class used by the UIMA POSTagger uses both
UIMA & Java style iterators. This is strongly discouraged:
https://uima.apache.org/downloads/releaseDocs/2.3.0-incubating/docs/api/org/apache/uima/cas/FSIterator.html
The consequence is that the AE breaks even on very simple CASes such as the CAS
showcased on this repository (text of 9 letters, 2 sentence annotations and 9
token annotations): https://github.com/m09/postagger-iterator-bug
The repository linked above contains a example program that crashes on my
setup. It's fully maven 3 aware so you can normally launch it quite easily.
> POSTagger UIMA AE broken because of AnnotationComboIterator
> -----------------------------------------------------------
>
> Key: OPENNLP-676
> URL: https://issues.apache.org/jira/browse/OPENNLP-676
> Project: OpenNLP
> Issue Type: Bug
> Components: POS Tagger, UIMA Integration
> Affects Versions: tools-1.5.3
> Environment: Oracle JDK8, Debian Jessie 64b
> Reporter: Hugo Mougard
> Priority: Critical
>
> The AnnotationComboIterator helper class used by the UIMA POSTagger uses both
> UIMA & Java style iterators. This is strongly discouraged:
> https://uima.apache.org/downloads/releaseDocs/2.3.0-incubating/docs/api/org/apache/uima/cas/FSIterator.html
> The consequence is that the AE breaks even on very simple CASes such as the
> CAS showcased on this repository (text of 9 letters, 2 sentence annotations
> and 9 token annotations):
> https://github.com/m09/postagger-iterator-bug/blob/master/in.xmi
> The repository linked above contains a example program that crashes on my
> setup. It's fully maven 3 aware so you can normally launch it quite easily.
--
This message was sent by Atlassian JIRA
(v6.2#6252)