Hugo Mougard created OPENNLP-676:
------------------------------------
Summary: POSTagger UIMA AE broken because of
AnnotationComboIterator
Key: OPENNLP-676
URL: https://issues.apache.org/jira/browse/OPENNLP-676
Project: OpenNLP
Issue Type: Bug
Components: POS Tagger, UIMA Integration
Affects Versions: tools-1.5.3
Environment: Oracle JDK8, Debian Jessie 64b
Reporter: Hugo Mougard
Priority: Critical
The AnnotationComboIterator helper class used by the UIMA POSTagger uses both
UIMA & Java style iterators. This is strongly discouraged:
https://uima.apache.org/downloads/releaseDocs/2.3.0-incubating/docs/api/org/apache/uima/cas/FSIterator.html
The consequence is that the AE breaks even on very simple CASes such as the CAS
showcased on this repository (text of 9 letters, 2 sentence annotations and 9
token annotations): https://github.com/m09/postagger-iterator-bug
The repository linked above contains a example program that crashes on my
setup. It's fully maven 3 aware so you can normally launch it quite easily.
--
This message was sent by Atlassian JIRA
(v6.2#6252)