2011/2/14 Aingaran Pillai <[email protected]>: > Hi, > > Is there any support planned to support Entity Extraction in other languages? > E.g. French, German, etc.
Yes it is planned. There is some cooperation underway with the upstream OpenNLP project to build new statistical language model from various free to redistribute corpora. I have also started some proof of concept tools: http://blogs.nuxeo.com/dev/2011/01/mining-wikipedia-with-hadoop-and-pig-for-natural-language-processing.html On the Stanbol side, we need to upgrade to OpenNLP 1.5 asap and un-hard-code the model loading: https://issues.apache.org/jira/browse/STANBOL-13 -- Olivier http://twitter.com/ogrisel - http://github.com/ogrisel
