[
https://issues.apache.org/jira/browse/OPENNLP-1270?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17312439#comment-17312439
]
Tim Allison commented on OPENNLP-1270:
--
{noformat}
Adding (bin)
[
https://issues.apache.org/jira/browse/OPENNLP-1270?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17311783#comment-17311783
]
Arky commented on OPENNLP-1270:
---
[~tallison] Okell, John corpus is available here
[
https://issues.apache.org/jira/browse/OPENNLP-1270?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17311718#comment-17311718
]
Tim Allison commented on OPENNLP-1270:
--
I haven't heard any objections to adding leipzig data to
[
https://issues.apache.org/jira/browse/OPENNLP-1270?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17310946#comment-17310946
]
Tim Allison commented on OPENNLP-1270:
--
We recently got a request on TIKA-3340 to add detection of
[
https://issues.apache.org/jira/browse/OPENNLP-1270?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17023639#comment-17023639
]
Suneel Marthi commented on OPENNLP-1270:
Could we also look at Europaarl corpus maybe?
[
https://issues.apache.org/jira/browse/OPENNLP-1270?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16867041#comment-16867041
]
Tim Allison commented on OPENNLP-1270:
--
I updated the tgz on the link above to include a large
[
https://issues.apache.org/jira/browse/OPENNLP-1270?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16862283#comment-16862283
]
Tim Allison commented on OPENNLP-1270:
--
Performance on the current languages doesn't change