Hi, Just for the record ... It can happen if a file contains context that at least written in two different languages. For instance, the first half of file, say, is a German and the second one, say ... a French. In such case detection would be faulty.
Br, Oleg On 3 Mar 2015 04:03, "Tyler Palsulich (JIRA)" <[email protected]> wrote: > > [ > https://issues.apache.org/jira/browse/TIKA-993?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel > ] > > Tyler Palsulich closed TIKA-993. > -------------------------------- > Resolution: Cannot Reproduce > > This issue is >2 years old and has no attachment for the text. So, I'm > closing as Cannot Reproduce. If you still have the text, please reopen! > > > Language Detection Fault > > ------------------------ > > > > Key: TIKA-993 > > URL: https://issues.apache.org/jira/browse/TIKA-993 > > Project: Tika > > Issue Type: Bug > > Components: languageidentifier > > Reporter: Iman Reihanian > > Attachments: DetectorImpl.java > > > > > > This text's language is English but it detects as Italy. > > > > -- > This message was sent by Atlassian JIRA > (v6.3.4#6332) >
