[ 
https://issues.apache.org/jira/browse/OPENNLP-81?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13966070#comment-13966070
 ] 

William Colen edited comment on OPENNLP-81 at 4/11/14 12:36 AM:
----------------------------------------------------------------

Created the evaluation CLI:

$ bin/opennlp DoccatEvaluator
Usage: opennlp DoccatEvaluator[.leipzig] [-reportOutputFile outputFile] 
[-misclassified true|false] -model model -data sampleData [-encoding 
charsetName]

Arguments description:
        -reportOutputFile outputFile
                the path of the fine-grained report file.
        -misclassified true|false
                if true will print false negatives and false positives.
        -model model
                the model file to be evaluated.
        -data sampleData
                data to be used, usually a file name.
        -encoding charsetName
                encoding for reading and writing text, if absent the system 
default is used.

The reportOutputFile includes F-Measure for each category and a confusion 
matrix.


was (Author: colen):
Created the evaluation CLI:

$ bin/opennlp DoccatEvaluator
Usage: opennlp DoccatEvaluator[.leipzig] [-reportOutputFile outputFile] 
[-misclassified true|false] -model model [-detailedF true|false] -data 
sampleData [-encoding charsetName]

Arguments description:
        -reportOutputFile outputFile
                the path of the fine-grained report file.
        -misclassified true|false
                if true will print false negatives and false positives.
        -model model
                the model file to be evaluated.
        -data sampleData
                data to be used, usually a file name.
        -encoding charsetName
                encoding for reading and writing text, if absent the system 
default is used.

The reportOutputFile includes F-Measure for each category and a confusion 
matrix.

> Add a cli tool for the doccat evaluation support
> ------------------------------------------------
>
>                 Key: OPENNLP-81
>                 URL: https://issues.apache.org/jira/browse/OPENNLP-81
>             Project: OpenNLP
>          Issue Type: Improvement
>          Components: Command Line Interface, Doccat
>            Reporter: Joern Kottmann
>            Assignee: William Colen
>            Priority: Minor
>             Fix For: 1.6.0
>
>
> There should be a command line tool which can be used to evaluate the 
> document categorizer model
> on a test file.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to