Thank you Rodrigo. I've tried the lemmatizer data but I found it's not as simple
as I hoped. It seems to require extra classes from IXA-PIPEs which is a 500 MB
download.
$ echo 'Todo es amor.' | ~/opt/apache-opennlp-1.9.1/bin/opennlp LemmatizerME
openNLP/data/es-lemma-perceptron-ancora-2.0.bin
Loading Lemmatizer model ... Exception in thread "main"
java.lang.IllegalArgumentException: opennlp.tools.util.InvalidFormatException:
Could not instantiate the eus.ixa.ixa.pipe.lemma.LemmatizerFactory. The
initialization throw an exception.
at opennlp.tools.util.model.BaseModel.initializeFactory(BaseModel.java:259)
at opennlp.tools.util.model.BaseModel.loadModel(BaseModel.java:234)
at opennlp.tools.util.model.BaseModel.<init>(BaseModel.java:176)
at opennlp.tools.lemmatizer.LemmatizerModel.<init>(LemmatizerModel.java:74)
at
opennlp.tools.cmdline.lemmatizer.LemmatizerModelLoader.loadModel(LemmatizerModelLoader.java:39)
at
opennlp.tools.cmdline.lemmatizer.LemmatizerModelLoader.loadModel(LemmatizerModelLoader.java:31)
at opennlp.tools.cmdline.ModelLoader.load(ModelLoader.java:56)
at
opennlp.tools.cmdline.lemmatizer.LemmatizerMETool.run(LemmatizerMETool.java:51)
at opennlp.tools.cmdline.CLI.main(CLI.java:259)
Caused by: opennlp.tools.util.InvalidFormatException: Could not instantiate the
eus.ixa.ixa.pipe.lemma.LemmatizerFactory. The initialization throw an exception.
at opennlp.tools.util.BaseToolFactory.create(BaseToolFactory.java:116)
at opennlp.tools.util.model.BaseModel.initializeFactory(BaseModel.java:257)
... 8 more
Caused by: opennlp.tools.util.ext.ExtensionNotLoadedException: Unable to find
implementation for opennlp.tools.util.BaseToolFactory, the class or service
*eus.ixa.ixa.pipe.lemma.LemmatizerFactory could not be located!*
at
opennlp.tools.util.ext.ExtensionLoader.instantiateExtension(ExtensionLoader.java:119)
at opennlp.tools.util.BaseToolFactory.create(BaseToolFactory.java:108)
... 9 more
On 7/10/19 7:15 AM, Rodrigo Agerri wrote:
You can also find an already trained lemmatizer (trained with general
news text) for Spanish here:
http://ixa2.si.ehu.es/ixa-pipes/
--
T. "Kuro" Kurosaka, Berkeley, California, USA