[ https://issues.apache.org/jira/browse/OPENNLP-53?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Martin Wiesner updated OPENNLP-53: ---------------------------------- Fix Version/s: 2.6.0 > Parser should have simple interface to process a tokenized input sentence > ------------------------------------------------------------------------- > > Key: OPENNLP-53 > URL: https://issues.apache.org/jira/browse/OPENNLP-53 > Project: OpenNLP > Issue Type: Improvement > Components: Parser > Reporter: Jörn Kottmann > Priority: Major > Fix For: 2.6.0 > > > The parser expects a tokenized sentence as input, but currently it must be > converted to a string where each > token is separated by a white space. > This interface turned out to be inconvenient if the input if the input > sentence is > provided as a list of strings or a string with a token span list. In both case > a new string must be created. In this new string the offsets of the > individual tokens > must be remember in order to retrieve the parse tree out of the Parse objects. > Create a more convenient way of interacting with an already tokenized > sentence which > is not in a whitespace separated format. -- This message was sent by Atlassian Jira (v8.20.10#820010)