Thank you Rodrigo. I've tried the lemmatizer data but I found it's not as simple as I hoped. It seems to require extra classes from IXA-PIPEs which is a 500 MB download.

$ echo 'Todo es amor.' | ~/opt/apache-opennlp-1.9.1/bin/opennlp LemmatizerME openNLP/data/es-lemma-perceptron-ancora-2.0.bin Loading Lemmatizer model ... Exception in thread "main" java.lang.IllegalArgumentException: opennlp.tools.util.InvalidFormatException: Could not instantiate the eus.ixa.ixa.pipe.lemma.LemmatizerFactory. The initialization throw an exception.
    at opennlp.tools.util.model.BaseModel.initializeFactory(BaseModel.java:259)
    at opennlp.tools.util.model.BaseModel.loadModel(BaseModel.java:234)
    at opennlp.tools.util.model.BaseModel.<init>(BaseModel.java:176)
    at opennlp.tools.lemmatizer.LemmatizerModel.<init>(LemmatizerModel.java:74)
    at opennlp.tools.cmdline.lemmatizer.LemmatizerModelLoader.loadModel(LemmatizerModelLoader.java:39)     at opennlp.tools.cmdline.lemmatizer.LemmatizerModelLoader.loadModel(LemmatizerModelLoader.java:31)
    at opennlp.tools.cmdline.ModelLoader.load(ModelLoader.java:56)
    at opennlp.tools.cmdline.lemmatizer.LemmatizerMETool.run(LemmatizerMETool.java:51)
    at opennlp.tools.cmdline.CLI.main(CLI.java:259)
Caused by: opennlp.tools.util.InvalidFormatException: Could not instantiate the eus.ixa.ixa.pipe.lemma.LemmatizerFactory. The initialization throw an exception.
    at opennlp.tools.util.BaseToolFactory.create(BaseToolFactory.java:116)
    at opennlp.tools.util.model.BaseModel.initializeFactory(BaseModel.java:257)
    ... 8 more
Caused by: opennlp.tools.util.ext.ExtensionNotLoadedException: Unable to find implementation for opennlp.tools.util.BaseToolFactory, the class or service *eus.ixa.ixa.pipe.lemma.LemmatizerFactory could not be located!*     at opennlp.tools.util.ext.ExtensionLoader.instantiateExtension(ExtensionLoader.java:119)
    at opennlp.tools.util.BaseToolFactory.create(BaseToolFactory.java:108)
    ... 9 more


On 7/10/19 7:15 AM, Rodrigo Agerri wrote:
You can also find an already trained lemmatizer (trained with general
news text) for Spanish here:

http://ixa2.si.ehu.es/ixa-pipes/

--
T. "Kuro" Kurosaka, Berkeley, California, USA

Reply via email to