On Sun, 2017-01-15 at 02:42 +0100, Richard Eckart de Castilho wrote: > On 14.01.2017, at 20:54, Joern Kottmann <kottm...@gmail.com> wrote: > > > > You can do that, we have a rule based detokeizer which can be used > > to > > produce training data from tokenized input. > > > > Have a look at the detokenizer in the tokenizer package. > > However, do you have any evaluation of the detokenizer? >
I opened an issue to add that: https://issues.apache.org/jira/browse/OPENNLP-941 Jörn