[
https://issues.apache.org/jira/browse/OPENNLP-1384?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17775283#comment-17775283
]
ASF GitHub Bot commented on OPENNLP-1384:
-----------------------------------------
rzo1 commented on code in PR #553:
URL: https://github.com/apache/opennlp/pull/553#discussion_r1359518613
##########
opennlp-dl/src/test/java/opennlp/dl/doccat/DocumentCategorizerDLEval.java:
##########
@@ -92,6 +92,46 @@ public void categorize() throws IOException, OrtException {
}
+ @Test
+ public void categorizeWithAutomaticLabels() throws IOException, OrtException
{
+
+ final File model = new File(getOpennlpDataDir(),
Review Comment:
For our evaluation tests in OpenNLP, we are downloading our whole evaluation
dataset (~3 GB) from nightlies.apache.org for each run and use that content for
the `OPENNLP_DATA_DIR`. So it doesn't really matter ;-) - I could imagine, that
it might be difficult to argue in a project like SOLR.
> Automatically generate document classifications map from model's config.json
> ----------------------------------------------------------------------------
>
> Key: OPENNLP-1384
> URL: https://issues.apache.org/jira/browse/OPENNLP-1384
> Project: OpenNLP
> Issue Type: Task
> Components: Deep Learning
> Affects Versions: 2.0.0
> Reporter: Jeff Zemerick
> Assignee: Jeff Zemerick
> Priority: Major
>
> Automatically generate classifications map from model's config.json.
> Currently, the implementations utilizing ONNX Runtime require a Map that
> stores the model-assigned value along with the human readable name for each
> value. This map must be created manually:
> Map<Integer, String> classifications = new HashMap<>();
> classifications.put(0, "negative");
> classifications.put(1, "positive");
> How to create this map is determined by looking at the model's config.json
> file. This task is to have OpenNLP read the config.json file and make the map
> automatically instead of requiring the user to make it manually.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)