Re: OpenNLP Name Finder training: Unsupported language: en

Xiang Ji Mon, 30 Jul 2018 13:07:35 -0700

The `eng` language code only applies to CoNLL-2003 related commands. Imade a PR and a JIRA issue to update the documentation.


On 30.07.2018 20:09, Xiang Ji wrote:

Apparently the language name is changed from `en` to `eng` in a laterversion... Why wasn't the documentation updated? (it still says "en")Was banging my head for 2 hours trying to figure it out!
On 30.07.2018 20:05, Xiang Ji wrote:
OK apparently using OpenNLP 1.5.3 works. Something is broken in thelater versions. I'll try to file a bug report.
On 30.07.2018 20:00, Xiang Ji wrote:
This seems to have something to do with the CoNLL2003 format. If Itry to run the trainer directly it works. However my input data isin CoNLL 2003 format. Running `TokenNameFinderConverter` gives methe same error. Even trying it on the official examplehttps://github.com/apache/opennlp/blob/master/opennlp-tools/src/test/resources/opennlp/tools/formats/conll2003-en.sampledoesn't work.
On 30.07.2018 18:33, Xiang Ji wrote:
Hi,
I’m trying to test training OpenNLP’s Name Finder on some data,according to the guide in the documentation. However, I encounteredthe error: |Unsupported language: en|, which doesn’t seem to makeany sense.
The command I ran is: |opennlp TokenNameFinderTrainer.conll03-model model.bin -lang en -types per,loc,org,misc -data train.txt-encoding UTF-8|
I downloaded OpenNLP 1.9.0 fromhttps://opennlp.apache.org/download.html. The |OPENNLP_HOME|environment variable does seem to be properly set, and the |lang|folder in the base folder contains an |en| folder.
Best regards,

Xiang Ji

Re: OpenNLP Name Finder training: Unsupported language: en

Reply via email to