Create reusable sequence detection classes
------------------------------------------
Key: OPENNLP-102
URL: https://issues.apache.org/jira/browse/OPENNLP-102
Project: OpenNLP
Issue Type: Improvement
Reporter: Jörn Kottmann
Priority: Minor
There is a common pattern that in many places a piece of code needs to detect a
BIO sequences and create spans out of these. It turned out that writing this
code is error prone and testing it with unit tests is complex.
We should create reusable classes which can perform the sequence detection and
can be used in all the places where it is required.
Having common code also makes difficult to do performance optimizations
possible, like mapping via perfect hashing instead of string parsing or regular
expressions.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.