[
https://issues.apache.org/jira/browse/OPENNLP-225?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13068783#comment-13068783
]
William Colen commented on OPENNLP-225:
---------------------------------------
I commited the initial code for it, but there is one issue I could not figure
out how to solve:
One can create a AbbreviationDictionary from a serialized file passing the
stream and a case sensitivity flag. But how will it work while loading the
dictionary from the model during runtime? The ArtifactSerializer.create method
don't know which flag to use to restore a dictionary was serialized to the
model.
> Restore the abbreviation dictionary support in SentenceDetector
> ---------------------------------------------------------------
>
> Key: OPENNLP-225
> URL: https://issues.apache.org/jira/browse/OPENNLP-225
> Project: OpenNLP
> Issue Type: Improvement
> Components: Command Line Interface, Sentence Detector
> Affects Versions: tools-1.5.2-incubating
> Reporter: William Colen
> Assignee: William Colen
> Priority: Minor
> Fix For: tools-1.5.2-incubating
>
>
> Today the abbreviation dictionary features of SentenceDetector are only
> usable though the API. We should add mechanism to allow training with an
> abbreviation dictionary from command line, and also add the dictionary to the
> model as we do with POS Tagger.
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira