[ 
https://issues.apache.org/jira/browse/OPENNLP-676?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Hugo Mougard updated OPENNLP-676:
---------------------------------

    Description: 
The AnnotationComboIterator helper class used by the UIMA POSTagger accesses 
its iterators unsafely.

The consequence is that the AE breaks even on very simple CASes such as the CAS 
showcased on this repository (text of 9 letters, 2 sentence annotations and 9 
token annotations): 
https://github.com/m09/postagger-iterator-bug/blob/master/in.xmi

The repository linked above contains an example program that crashes on my 
setup. It's fully maven 3 aware so you can normally launch it quite easily.

  was:
The AnnotationComboIterator helper class used by the UIMA POSTagger uses both 
UIMA & Java style iterators. This is strongly discouraged: 
https://uima.apache.org/downloads/releaseDocs/2.3.0-incubating/docs/api/org/apache/uima/cas/FSIterator.html

The consequence is that the AE breaks even on very simple CASes such as the CAS 
showcased on this repository (text of 9 letters, 2 sentence annotations and 9 
token annotations): 
https://github.com/m09/postagger-iterator-bug/blob/master/in.xmi

The repository linked above contains an example program that crashes on my 
setup. It's fully maven 3 aware so you can normally launch it quite easily.


> POSTagger UIMA AE broken because of AnnotationComboIterator
> -----------------------------------------------------------
>
>                 Key: OPENNLP-676
>                 URL: https://issues.apache.org/jira/browse/OPENNLP-676
>             Project: OpenNLP
>          Issue Type: Bug
>          Components: POS Tagger, UIMA Integration
>    Affects Versions: tools-1.5.3
>         Environment: Oracle JDK8, Debian Jessie 64b
>            Reporter: Hugo Mougard
>
> The AnnotationComboIterator helper class used by the UIMA POSTagger accesses 
> its iterators unsafely.
> The consequence is that the AE breaks even on very simple CASes such as the 
> CAS showcased on this repository (text of 9 letters, 2 sentence annotations 
> and 9 token annotations): 
> https://github.com/m09/postagger-iterator-bug/blob/master/in.xmi
> The repository linked above contains an example program that crashes on my 
> setup. It's fully maven 3 aware so you can normally launch it quite easily.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to