The first found. In this case will be German. Expexted result - a topic to discuss. I would expect to get both detected languages. However it is beyond tika's lang.dect.
Bottom line, so be it as is until Ken's implementation. On 3 Mar 2015 09:09, "Tyler Palsulich" <[email protected]> wrote: > Hi, > > What do you mean, the detection is faulty? What is the expected result in > that case? > > Thanks, > Tyler > On Mar 3, 2015 1:10 AM, "Oleg Tikhonov" <[email protected]> wrote: > > > Hi, > > Just for the record ... > > It can happen if a file contains context that at least written in two > > different languages. For instance, the first half of file, say, is a > German > > and the second one, say ... a French. In such case detection would be > > faulty. > > > > Br, > > Oleg > > On 3 Mar 2015 04:03, "Tyler Palsulich (JIRA)" <[email protected]> wrote: > > > > > > > > [ > > > > > > https://issues.apache.org/jira/browse/TIKA-993?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel > > > ] > > > > > > Tyler Palsulich closed TIKA-993. > > > -------------------------------- > > > Resolution: Cannot Reproduce > > > > > > This issue is >2 years old and has no attachment for the text. So, I'm > > > closing as Cannot Reproduce. If you still have the text, please reopen! > > > > > > > Language Detection Fault > > > > ------------------------ > > > > > > > > Key: TIKA-993 > > > > URL: https://issues.apache.org/jira/browse/TIKA-993 > > > > Project: Tika > > > > Issue Type: Bug > > > > Components: languageidentifier > > > > Reporter: Iman Reihanian > > > > Attachments: DetectorImpl.java > > > > > > > > > > > > This text's language is English but it detects as Italy. > > > > > > > > > > > > -- > > > This message was sent by Atlassian JIRA > > > (v6.3.4#6332) > > > > > >
