[ 
https://issues.apache.org/jira/browse/OPENNLP-53?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Martin Wiesner updated OPENNLP-53:
----------------------------------
    Fix Version/s: 2.6.0

> Parser should have simple interface to process a tokenized input sentence
> -------------------------------------------------------------------------
>
>                 Key: OPENNLP-53
>                 URL: https://issues.apache.org/jira/browse/OPENNLP-53
>             Project: OpenNLP
>          Issue Type: Improvement
>          Components: Parser
>            Reporter: Jörn Kottmann
>            Priority: Major
>             Fix For: 2.6.0
>
>
> The parser expects a tokenized sentence as input, but currently it must be 
> converted to a string where each
> token is separated by a white space.
> This interface turned out to be inconvenient if the input if the input 
> sentence is
> provided as a list of strings or a string with a token span list. In both case
> a new string must be created. In this new string the offsets of the 
> individual tokens
> must be remember in order to retrieve the parse tree out of the Parse objects.
> Create a more convenient way of interacting with an already tokenized 
> sentence which
> is not in a whitespace separated format. 



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to