[
https://issues.apache.org/jira/browse/OPENNLP-788?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16049299#comment-16049299
]
ASF GitHub Bot commented on OPENNLP-788:
----------------------------------------
GitHub user wcolen opened a pull request:
https://github.com/apache/opennlp/pull/230
OPENNLP-788: Add LanguageDetector tool
Thank you for contributing to Apache OpenNLP.
In order to streamline the review of the contribution we ask you
to ensure the following steps have been taken:
### For all changes:
- [X] Is there a JIRA ticket associated with this PR? Is it referenced
in the commit message?
- [X] Does your PR title start with OPENNLP-XXXX where XXXX is the JIRA
number you are trying to resolve? Pay particular attention to the hyphen "-"
character.
- [X] Has your PR been rebased against the latest commit within the target
branch (typically master)?
- [X] Is your initial contribution a single, squashed commit?
### For code changes:
- [X] Have you ensured that the full suite of tests is executed via mvn
clean install at the root opennlp folder?
- [X] Have you written or updated unit tests to verify your changes?
- [ ] If adding new dependencies to the code, are these dependencies
licensed in a way that is compatible for inclusion under [ASF
2.0](http://www.apache.org/legal/resolved.html#category-a)?
- [ ] If applicable, have you updated the LICENSE file, including the main
LICENSE file in opennlp folder?
- [ ] If applicable, have you updated the NOTICE file, including the main
NOTICE file found in opennlp folder?
### For documentation related changes:
- [ ] Have you ensured that format looks appropriate for the output in
which it is rendered?
### Note:
Please ensure that once the PR is submitted, you check travis-ci for build
issues and submit an update to your PR as soon as possible.
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/apache/opennlp LangDetect
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/opennlp/pull/230.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #230
----
commit 6b689681c12c162c688081cf3e358e9e082b08d7
Author: William D C M SILVA <[email protected]>
Date: 2017-05-17T16:34:21Z
OPENNLP-788: Add LanguageDetector tool
----
> Add a language detection component
> ----------------------------------
>
> Key: OPENNLP-788
> URL: https://issues.apache.org/jira/browse/OPENNLP-788
> Project: OpenNLP
> Issue Type: Improvement
> Reporter: Joern Kottmann
> Labels: help-wanted
>
> Many of the components in OpenNLP are sensitive to the input language. It
> would be nice if OpenNLP would have a component to detect the language of an
> input text.
> Two commonly used solutions today are:
> Apache Tikas Language Identifier
> Language Detection from Shuyo, Nakatani
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)