French model for POS tagging with OpenNLP

2012-01-19 Thread Robert VISEUR
Hi, We are actually using OpenNLP for POS tagging tasks (with news articles). Part of the articles are in French, and I see there wasn't french POS tagging model in the common OpenNLP package. Do you know a French public model for POS tagging in Open NLP ? Thanks, Best regards, Robert.

Re: French model for POS tagging with OpenNLP

2012-01-19 Thread Jason Baldridge
Unfortunately, there is no data I'm aware of for training models for French. There are efforts underway to get multilingual annotations going on unrestricted texts, but they are still in the sandbox. Help with those would be welcome! On Thu, Jan 19, 2012 at 10:27 AM, Robert VISEUR wrote: > Hi, >

Re: French model for POS tagging with OpenNLP

2012-01-19 Thread Nicolas Hernandez
Hi Robert We used (and still use) the French Treebank (Paris 7 Abeille) for building machine learning models for (pre)processing French and some of them for OpenNLP. I say 'still use' because the French Treebank is not always consistent and we are trying "to correct it" in some way. About the rel

Re: French model for POS tagging with OpenNLP

2012-01-19 Thread Jason Baldridge
That's great to hear. I thought the French Treebank licensing was pretty clear about how artifacts that could be trained on it could be used. Please keep us informed about the French data situation! FWIW, while I very much want to see the creation of unrestricted data with unrestricted annotations