Martin Wiesner created OPENNLP-1446:
---------------------------------------
Summary: Investigate why LeskEvaluatorTest fails while parsing
'EnglishLS.train'
Key: OPENNLP-1446
URL: https://issues.apache.org/jira/browse/OPENNLP-1446
Project: OpenNLP
Issue Type: Task
Components: wsd
Affects Versions: 2.1.0
Reporter: Martin Wiesner
The {{{}LeskEvaluatorTest in the _opennlp-wsd_ sandbox component fails parsing
the '{}}}EnglishLS.train' file. The data is kept original, downloaded fromĀ
[https://web.eecs.umich.edu/~mihalcea/senseval/senseval3/data.html]
Aims:
* Investigate what causes the xml parsing to fail
* Fix it and make the existing test pass
* Optional: Improve the existing test code to be more strict.
Note:
The test setup to reproduce this is on a branch and to be merged into the main
branch.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)