Parser should have simple interface to process a tokenized input sentence
-------------------------------------------------------------------------

                 Key: OPENNLP-53
                 URL: https://issues.apache.org/jira/browse/OPENNLP-53
             Project: OpenNLP
          Issue Type: Improvement
          Components: Parser
            Reporter: Jörn Kottmann


The parser expects a tokenized sentence as input, but currently it must be 
converted to a string where each
token is separated by a white space.

This interface turned out to be inconvenient if the input if the input sentence 
is
provided as a list of strings or a string with a token span list. In both case
a new string must be created. In this new string the offsets of the individual 
tokens
must be remember in order to retrieve the parse tree out of the Parse objects.

Create a more convenient way of interacting with an already tokenized sentence 
which
is not in a whitespace separated format. 

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to