Given the following code: val japanese = "私はガラスを食べられます。それは私を傷つけません。" LanguageDetector.getDefaultLanguageDetector.loadModels().detectAll(japanese)
it produces [zh-CN: MEDIUM (0.579961), zh-TW: MEDIUM (0.405015)] And the same thing for many short Japanese sentences. Apache Tika 1.17
