[
https://issues.apache.org/jira/browse/OPENNLP-1270?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16862280#comment-16862280
]
Tim Allison commented on OPENNLP-1270:
--------------------------------------
Here's a link to the *-sentences.txt files from Leipzig for these additions:
[http://162.242.228.174/lang_detect/OPENNLP-1270_new_leipzig_langs.tgz]
Let me know if you'd like the model.
> Add new languages to the language detector
> ------------------------------------------
>
> Key: OPENNLP-1270
> URL: https://issues.apache.org/jira/browse/OPENNLP-1270
> Project: OpenNLP
> Issue Type: Task
> Reporter: Tim Allison
> Priority: Major
> Attachments: report.txt
>
>
> Leipzig has several other languages that might be useful to add to the
> language detector. I've selected some with > 10k sentences. Once I build
> the model and evaluate performance, I'll share the reports, the model and a
> tgz of the *-sentences.txt files.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)