[ 
https://issues.apache.org/jira/browse/OPENNLP-231?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13089363#comment-13089363
 ] 

Joern Kottmann commented on OPENNLP-231:
----------------------------------------

You might want to create the ngram dictionary on a much larger text corpus, 
instead of the training data. Now its both possible, if I remember correctly 
Tom told me that didn't work well, and was more like an experiment, maybe we 
should validate this statement, and if it turn out to be true, it should be 
removed one day.

> POS Tagger cross validator tool is not evaluating models that includes ngram 
> dictionaries.
> ------------------------------------------------------------------------------------------
>
>                 Key: OPENNLP-231
>                 URL: https://issues.apache.org/jira/browse/OPENNLP-231
>             Project: OpenNLP
>          Issue Type: Improvement
>          Components: Command Line Interface, POS Tagger
>    Affects Versions: tools-1.5.2-incubating
>            Reporter: William Colen
>            Assignee: William Colen
>            Priority: Minor
>             Fix For: tools-1.5.2-incubating
>
>
> The parameter -ngram is present on POS Tagger trainer tool, but it is not 
> present on CV tool.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to