[
https://issues.apache.org/jira/browse/TIKA-1622?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14542386#comment-14542386
]
Chris A. Mattmann commented on TIKA-1622:
-----------------------------------------
hi [~tledoux] so I tried the patch out, and for whatever reason, Tika's
language identifier detects the corrected french sentence as italian, and thus
fails the unit test. Sigh. FYI this:
{noformat}
-------------------------------------------------------
T E S T S
-------------------------------------------------------
Running org.apache.tika.server.DetectorResourceTest
Tests run: 2, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 1.774 sec - in
org.apache.tika.server.DetectorResourceTest
Running org.apache.tika.server.LanguageResourceTest
Tests run: 4, Failures: 2, Errors: 0, Skipped: 0, Time elapsed: 0.277 sec <<<
FAILURE! - in org.apache.tika.server.LanguageResourceTest
testDetectFrenchString(org.apache.tika.server.LanguageResourceTest) Time
elapsed: 0.048 sec <<< FAILURE!
org.junit.ComparisonFailure: expected:<[fr]> but was:<[it]>
at org.junit.Assert.assertEquals(Assert.java:115)
at org.junit.Assert.assertEquals(Assert.java:144)
at
org.apache.tika.server.LanguageResourceTest.testDetectFrenchString(LanguageResourceTest.java:82)
testDetectFrenchFile(org.apache.tika.server.LanguageResourceTest) Time
elapsed: 0.031 sec <<< FAILURE!
org.junit.ComparisonFailure: expected:<[fr]> but was:<[it]>
at org.junit.Assert.assertEquals(Assert.java:115)
at org.junit.Assert.assertEquals(Assert.java:144)
at
org.apache.tika.server.LanguageResourceTest.testDetectFrenchFile(LanguageResourceTest.java:106)
{noformat}
> Expose Tika LanguageIdentifier via Tika Server
> ----------------------------------------------
>
> Key: TIKA-1622
> URL: https://issues.apache.org/jira/browse/TIKA-1622
> Project: Tika
> Issue Type: Bug
> Components: languageidentifier, server
> Reporter: Chris A. Mattmann
> Assignee: Chris A. Mattmann
> Fix For: 1.9
>
> Attachments: TIKA-1622-commeci.patch
>
>
> The LanguageIdentifier in Tika should be exposed via Tika JAX-RS.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)