On 06/19/2012 04:47 PM, Mariya Koleva wrote:
And yes, what I'm ultimately planning to do is to train POS models for Zulu
and other related languages, and hopefully have them out for the community.
The POS data on this website is already in the OpenNLP format.
It should work if you follow the instructions here:
http://opennlp.apache.org/documentation/1.5.2-incubating/manual/opennlp.html#tools.postagger.training
You can use the command which is given there and just pass in the data
from the website,
you might need to set a different encoding.
Jörn