Hello

I have tried to train a Parser model from the data provided by http://universaldependencies.org/ in the conllu format,
but I failed.

The documentation for the training process (with the java API) is lacking (how to read from conllu files ?) and it looks like the command line utility expects things to be in the openNlp format (what is that ? is there some utility that can convert from conllu format to openNlp ?)

It is really frustrating, not to be able to train a model (I just spent days trying) especially since using pre-trained model is a breeze and make you believe that parsing is right at the tip of your fingers (it you manage to create an ObjectStream<Parse> from a conllu file, that is...which I miserably failed)

Can someone please help, by providing a working sample of training a Parser from a conllu file ? or for creating an ObjectStream<Parse> from a conllu treebank (like universal dependency)

(the one provided in the openNlp documentation (http://opennlp.apache.org/docs/1.8.4/manual/opennlp.html#tools.parser.parsing.api) is somewhat, by the way : it doesn't tell how to create mlParameters, etc...)

best regards,
Olivier




Reply via email to