[ 
https://issues.apache.org/jira/browse/OPENNLP-225?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13064783#comment-13064783
 ] 

William Colen edited comment on OPENNLP-225 at 7/13/11 7:16 PM:
----------------------------------------------------------------

Question: should we create an AbbreviationDictionary class that wraps the 
Dictionary to reuse the parse/serialize mechanism? As I understand an 
abbreviation dictionary is much simpler than our Dictionary implementation and 
maybe using the same mechanism should be overkilling.

Today the DefaultSDContextGenerator expects a Set<String> as 
abbreviationDictionary.

      was (Author: colen):
    Question: should we create an AbbreviationDictionary class that wraps the 
Dictionary to reuse the parse/serialize mechanism? As I understand an 
abbreviation dictionary is much simpler than our Dictionary implementation and 
maybe using the same mechanism should be overkilling.

Today teh DefaultSDContextGenerator expects a Set<String> as 
abbreviationDictionary.
  
> Restore the abbreviation dictionary support in SentenceDetector
> ---------------------------------------------------------------
>
>                 Key: OPENNLP-225
>                 URL: https://issues.apache.org/jira/browse/OPENNLP-225
>             Project: OpenNLP
>          Issue Type: Improvement
>          Components: Command Line Interface, Sentence Detector
>    Affects Versions: tools-1.5.2-incubating
>            Reporter: William Colen
>            Assignee: William Colen
>            Priority: Minor
>             Fix For: tools-1.5.2-incubating
>
>
> Today the abbreviation dictionary features of SentenceDetector are only 
> usable though the API. We should add mechanism to allow training with an 
> abbreviation dictionary from command line, and also add the dictionary to the 
> model as we do with POS Tagger.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to