[
https://issues.apache.org/jira/browse/TIKA-3456?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17370878#comment-17370878
]
Hudson commented on TIKA-3456:
------------------------------
SUCCESS: Integrated in Jenkins build Tika ยป tika-main-jdk8 #263 (See
[https://ci-builds.apache.org/job/Tika/job/tika-main-jdk8/263/])
TIKA-3456 -- LanguageDetector should try to respect "hasEnoughText" if the user
adds a large string (tallison:
[https://github.com/apache/tika/commit/b5e78ac80f290069aa0e1fe084421b66bf1e8c5f])
* (edit)
tika-core/src/main/java/org/apache/tika/language/detect/LanguageDetector.java
> LanguageDetector should try to respect hasEnoughText more intelligently
> -----------------------------------------------------------------------
>
> Key: TIKA-3456
> URL: https://issues.apache.org/jira/browse/TIKA-3456
> Project: Tika
> Issue Type: Task
> Reporter: Tim Allison
> Priority: Minor
> Fix For: 1.27
>
>
> If a user calls LanguageDetector's detect(String txt) or addText(String txt),
> the full string is passed on to the subclasses and there is no check on
> "hasEnoughText()". For large strings, LanguageDetector should break the
> string into smaller parts and check for hasEnoughText().
--
This message was sent by Atlassian Jira
(v8.3.4#803005)