[ 
https://issues.apache.org/jira/browse/OPENNLP-676?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Hugo Mougard updated OPENNLP-676:
---------------------------------

    Description: 
The AnnotationComboIterator helper class used by the UIMA POSTagger uses both 
UIMA & Java style iterators. This is strongly discouraged: 
https://uima.apache.org/downloads/releaseDocs/2.3.0-incubating/docs/api/org/apache/uima/cas/FSIterator.html

The consequence is that the AE breaks even on very simple CASes such as the CAS 
showcased on this repository (text of 9 letters, 2 sentence annotations and 9 
token annotations): 
https://github.com/m09/postagger-iterator-bug/blob/master/in.xmi

The repository linked above contains an example program that crashes on my 
setup. It's fully maven 3 aware so you can normally launch it quite easily.

  was:
The AnnotationComboIterator helper class used by the UIMA POSTagger uses both 
UIMA & Java style iterators. This is strongly discouraged: 
https://uima.apache.org/downloads/releaseDocs/2.3.0-incubating/docs/api/org/apache/uima/cas/FSIterator.html

The consequence is that the AE breaks even on very simple CASes such as the CAS 
showcased on this repository (text of 9 letters, 2 sentence annotations and 9 
token annotations): 
https://github.com/m09/postagger-iterator-bug/blob/master/in.xmi

The repository linked above contains a example program that crashes on my 
setup. It's fully maven 3 aware so you can normally launch it quite easily.


> POSTagger UIMA AE broken because of AnnotationComboIterator
> -----------------------------------------------------------
>
>                 Key: OPENNLP-676
>                 URL: https://issues.apache.org/jira/browse/OPENNLP-676
>             Project: OpenNLP
>          Issue Type: Bug
>          Components: POS Tagger, UIMA Integration
>    Affects Versions: tools-1.5.3
>         Environment: Oracle JDK8, Debian Jessie 64b
>            Reporter: Hugo Mougard
>            Priority: Critical
>
> The AnnotationComboIterator helper class used by the UIMA POSTagger uses both 
> UIMA & Java style iterators. This is strongly discouraged: 
> https://uima.apache.org/downloads/releaseDocs/2.3.0-incubating/docs/api/org/apache/uima/cas/FSIterator.html
> The consequence is that the AE breaks even on very simple CASes such as the 
> CAS showcased on this repository (text of 9 letters, 2 sentence annotations 
> and 9 token annotations): 
> https://github.com/m09/postagger-iterator-bug/blob/master/in.xmi
> The repository linked above contains an example program that crashes on my 
> setup. It's fully maven 3 aware so you can normally launch it quite easily.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to