scadams commented on PR #94: URL: https://github.com/apache/opennlp-site/pull/94#issuecomment-3342755333
@jzonthemtn @rzo1 Hopefully this answers your questions: The training data came from CoNLL 2006 and all of it can be downloaded here along with documentation and license information: [CoNLL-X Shared Task: Multi-lingual Dependency Parsing](https://web.archive.org/web/20070503133311/http://nextens.uvt.nl/~conll/free_data.html) These are/were the data sources for each language: - Danish: The Danish Dependency Treebank - Dutch: The Alpino Treebank - Portuguese: The Floresta Sintá(c)tica project - Swedish: Talbanken05 Swedish treebank This section of the wiki I mentioned above describes how the models were trained using this data: https://web.archive.org/web/20100917162145/http://sourceforge.net/apps/mediawiki/opennlp/index.php?title=Newlang#Language_Data -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
