[
https://issues.apache.org/jira/browse/NUTCH-2665?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16661983#comment-16661983
]
Markus Jelsma commented on NUTCH-2665:
--------------------------------------
Helloe [~axr], yes it compiles fine, that is where the default.properties patch
is for.
Running tests:
{code}
ContentType http://127.0.0.1:47501/basic-http.jsp
expected:<[application/xhtml+x]ml> but was:<[text/ht]ml>
junit.framework.AssertionFailedError: ContentType
http://127.0.0.1:47501/basic-http.jsp expected:<[application/xhtml+x]ml> but
was:<[text/ht]ml>
at
org.apache.nutch.protocol.http.TestProtocolHttp.fetchPage(TestProtocolHttp.java:134)
at
org.apache.nutch.protocol.http.TestProtocolHttp.testStatusCode(TestProtocolHttp.java:79)
{code}
This fails, but i am actually fine with this response. I propose to change the
test to assert for text/html instead. Opinions?
> Upgrade to Apache Tika 1.19.1
> -----------------------------
>
> Key: NUTCH-2665
> URL: https://issues.apache.org/jira/browse/NUTCH-2665
> Project: Nutch
> Issue Type: Task
> Components: parser
> Affects Versions: 2.3.1
> Reporter: Markus Jelsma
> Assignee: Markus Jelsma
> Priority: Major
> Fix For: 2.4
>
> Attachments: NUTCH-2665.patch, NUTCH-2665.patch
>
>
> Borrowing from [~wastl-nagel]'s efforts on NUTCH-2651, 2.x can be upgraded to
> Apache Tika 1.19.1 as well.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)