Hi,
We are actually using OpenNLP for POS tagging tasks (with news
articles). Part of the articles are in French, and I see there wasn't
french POS tagging model in the common OpenNLP package. Do you know a
French public model for POS tagging in Open NLP ?
Thanks,
Best regards,
Robert.
Unfortunately, there is no data I'm aware of for training models for
French. There are efforts underway to get multilingual annotations going on
unrestricted texts, but they are still in the sandbox. Help with those
would be welcome!
On Thu, Jan 19, 2012 at 10:27 AM, Robert VISEUR wrote:
> Hi,
>
Hi Robert
We used (and still use) the French Treebank (Paris 7 Abeille) for building
machine learning models for (pre)processing French and some of them for
OpenNLP.
I say 'still use' because the French Treebank is not always consistent and
we are trying "to correct it" in some way.
About the rel
That's great to hear. I thought the French Treebank licensing was pretty
clear about how artifacts that could be trained on it could be used. Please
keep us informed about the French data situation!
FWIW, while I very much want to see the creation of unrestricted data with
unrestricted annotations