[
https://issues.apache.org/jira/browse/NUTCH-1925?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14328194#comment-14328194
]
Tyler Palsulich commented on NUTCH-1925:
----------------------------------------
Thanks [~wastl-nagel]. Looking into it more,
org.apache.nutch.parse.tika.TikaConfig was deleted on the 1.x branch in
NUTCH-1234 (see [this
commit|https://github.com/apache/nutch/commit/7f44cdc998117eacc04609008fdac4ce1e2bb387#diff-a883bfa38ab4c09e2ee777564297367e])
in favor of org.apache.tika.config.TikaConfig. But, the same change was never
done on the 2.x branch. I can supply a patch that does it, but it will require
some API changes. That should fix the discrepancy we're seeing between 1.x and
2.x in this issue. Thoughts?
> Upgrade Tika to version 1.7
> ---------------------------
>
> Key: NUTCH-1925
> URL: https://issues.apache.org/jira/browse/NUTCH-1925
> Project: Nutch
> Issue Type: Improvement
> Components: build
> Reporter: Tyler Palsulich
> Assignee: Markus Jelsma
> Priority: Blocker
> Fix For: 1.10, 2.3.1
>
> Attachments: NUTCH-1925-2x.patch, NUTCH-1925.palsulich.p2.patch,
> NUTCH-1925.palsulich.patch, NUTCH-1925.palsulich.v2.patch
>
>
> Hi Folks. Nutch currently uses version 1.6 of Tika. There were no significant
> API changes between 1.6 and 1.7. So, this should be a one line update.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)