Freeling will do everything you need in Spanish and more: http://nlp.lsi.upc.edu/freeling/index.php/node/4
jds On Wed, Jul 10, 2019 at 7:17 PM T. Kuro Kurosaka <k...@bhlab.com> wrote: > Thank you Rodrigo. I've tried the lemmatizer data but I found it's not as > simple > as I hoped. It seems to require extra classes from IXA-PIPEs which is a > 500 MB > download. > > $ echo 'Todo es amor.' | ~/opt/apache-opennlp-1.9.1/bin/opennlp > LemmatizerME > openNLP/data/es-lemma-perceptron-ancora-2.0.bin > Loading Lemmatizer model ... Exception in thread "main" > java.lang.IllegalArgumentException: > opennlp.tools.util.InvalidFormatException: > Could not instantiate the eus.ixa.ixa.pipe.lemma.LemmatizerFactory. The > initialization throw an exception. > at > opennlp.tools.util.model.BaseModel.initializeFactory(BaseModel.java:259) > at opennlp.tools.util.model.BaseModel.loadModel(BaseModel.java:234) > at opennlp.tools.util.model.BaseModel.<init>(BaseModel.java:176) > at > opennlp.tools.lemmatizer.LemmatizerModel.<init>(LemmatizerModel.java:74) > at > > opennlp.tools.cmdline.lemmatizer.LemmatizerModelLoader.loadModel(LemmatizerModelLoader.java:39) > at > > opennlp.tools.cmdline.lemmatizer.LemmatizerModelLoader.loadModel(LemmatizerModelLoader.java:31) > at opennlp.tools.cmdline.ModelLoader.load(ModelLoader.java:56) > at > > opennlp.tools.cmdline.lemmatizer.LemmatizerMETool.run(LemmatizerMETool.java:51) > at opennlp.tools.cmdline.CLI.main(CLI.java:259) > Caused by: opennlp.tools.util.InvalidFormatException: Could not > instantiate the > eus.ixa.ixa.pipe.lemma.LemmatizerFactory. The initialization throw an > exception. > at opennlp.tools.util.BaseToolFactory.create(BaseToolFactory.java:116) > at > opennlp.tools.util.model.BaseModel.initializeFactory(BaseModel.java:257) > ... 8 more > Caused by: opennlp.tools.util.ext.ExtensionNotLoadedException: Unable to > find > implementation for opennlp.tools.util.BaseToolFactory, the class or > service > *eus.ixa.ixa.pipe.lemma.LemmatizerFactory could not be located!* > at > > opennlp.tools.util.ext.ExtensionLoader.instantiateExtension(ExtensionLoader.java:119) > at opennlp.tools.util.BaseToolFactory.create(BaseToolFactory.java:108) > ... 9 more > > > On 7/10/19 7:15 AM, Rodrigo Agerri wrote: > > You can also find an already trained lemmatizer (trained with general > > news text) for Spanish here: > > > > http://ixa2.si.ehu.es/ixa-pipes/ > > -- > T. "Kuro" Kurosaka, Berkeley, California, USA > >