Hello all, since Apache OpenNLP 1.8.1 we have a new language detection component which like all our components has to be trained. I think we should release a pre-build model for it trained on the Leipzig corpus. This will allow the majority of our users to get started very quickly with language detection without the need to figure out on how to train it.
How should this project release models? Jörn