[
https://issues.apache.org/jira/browse/OPENNLP-81?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13966070#comment-13966070
]
William Colen edited comment on OPENNLP-81 at 4/11/14 12:36 AM:
----------------------------------------------------------------
Created the evaluation CLI:
$ bin/opennlp DoccatEvaluator
Usage: opennlp DoccatEvaluator[.leipzig] [-reportOutputFile outputFile]
[-misclassified true|false] -model model -data sampleData [-encoding
charsetName]
Arguments description:
-reportOutputFile outputFile
the path of the fine-grained report file.
-misclassified true|false
if true will print false negatives and false positives.
-model model
the model file to be evaluated.
-data sampleData
data to be used, usually a file name.
-encoding charsetName
encoding for reading and writing text, if absent the system
default is used.
The reportOutputFile includes F-Measure for each category and a confusion
matrix.
was (Author: colen):
Created the evaluation CLI:
$ bin/opennlp DoccatEvaluator
Usage: opennlp DoccatEvaluator[.leipzig] [-reportOutputFile outputFile]
[-misclassified true|false] -model model [-detailedF true|false] -data
sampleData [-encoding charsetName]
Arguments description:
-reportOutputFile outputFile
the path of the fine-grained report file.
-misclassified true|false
if true will print false negatives and false positives.
-model model
the model file to be evaluated.
-data sampleData
data to be used, usually a file name.
-encoding charsetName
encoding for reading and writing text, if absent the system
default is used.
The reportOutputFile includes F-Measure for each category and a confusion
matrix.
> Add a cli tool for the doccat evaluation support
> ------------------------------------------------
>
> Key: OPENNLP-81
> URL: https://issues.apache.org/jira/browse/OPENNLP-81
> Project: OpenNLP
> Issue Type: Improvement
> Components: Command Line Interface, Doccat
> Reporter: Joern Kottmann
> Assignee: William Colen
> Priority: Minor
> Fix For: 1.6.0
>
>
> There should be a command line tool which can be used to evaluate the
> document categorizer model
> on a test file.
--
This message was sent by Atlassian JIRA
(v6.2#6252)