[ 
https://issues.apache.org/jira/browse/OPENNLP-225?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13068783#comment-13068783
 ] 

William Colen commented on OPENNLP-225:
---------------------------------------

I commited the initial code for it, but there is one issue I could not figure 
out how to solve:
One can create a AbbreviationDictionary from a serialized file passing the 
stream and a case sensitivity flag. But how will it work while loading the 
dictionary from the model during runtime? The ArtifactSerializer.create method 
don't know which flag to use to restore a dictionary was serialized to the 
model.

> Restore the abbreviation dictionary support in SentenceDetector
> ---------------------------------------------------------------
>
>                 Key: OPENNLP-225
>                 URL: https://issues.apache.org/jira/browse/OPENNLP-225
>             Project: OpenNLP
>          Issue Type: Improvement
>          Components: Command Line Interface, Sentence Detector
>    Affects Versions: tools-1.5.2-incubating
>            Reporter: William Colen
>            Assignee: William Colen
>            Priority: Minor
>             Fix For: tools-1.5.2-incubating
>
>
> Today the abbreviation dictionary features of SentenceDetector are only 
> usable though the API. We should add mechanism to allow training with an 
> abbreviation dictionary from command line, and also add the dictionary to the 
> model as we do with POS Tagger.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to