[
https://issues.apache.org/jira/browse/NUTCH-2158?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Julien Nioche updated NUTCH-2158:
---------------------------------
Attachment: NUTCH-2158.patch
Patch which upgrades to Tika 1.11
tests fail for protocol-http
{code}
Testcase: testStatusCode took 3.648 sec
FAILED
ContentType http://127.0.0.1:47504/basic-http.jsp
expected:<[application/xhtml+x]ml> but was:<[text/ht]ml>
junit.framework.AssertionFailedError: ContentType
http://127.0.0.1:47504/basic-http.jsp expected:<[application/xhtml+x]ml> but
was:<[text/ht]ml>
at
org.apache.nutch.protocol.http.TestProtocolHttp.fetchPage(TestProtocolHttp.java:136)
at
org.apache.nutch.protocol.http.TestProtocolHttp.testStatusCode(TestProtocolHttp.java:80)
{code}
mimetype detected is different but probably correct. Will fix later.
> Upgrade to Tika 1.11
> --------------------
>
> Key: NUTCH-2158
> URL: https://issues.apache.org/jira/browse/NUTCH-2158
> Project: Nutch
> Issue Type: Task
> Components: parser
> Reporter: Chris A. Mattmann
> Assignee: Julien Nioche
> Fix For: 1.11
>
> Attachments: NUTCH-2158.patch
>
>
> Upgrade parse-tika to 1.11 release for Tika.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)