[
https://issues.apache.org/jira/browse/NUTCH-2959?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17725839#comment-17725839
]
Sebastian Nagel commented on NUTCH-2959:
----------------------------------------
Hi [~tallison], if running in local mode it might be a good option to delegate
the parsing to a separate process. When running on a Hadoop cluster, it might
cause some headaches to get the process running on the task nodes.
> Upgrade to Apache Tika 2.4.1
> ----------------------------
>
> Key: NUTCH-2959
> URL: https://issues.apache.org/jira/browse/NUTCH-2959
> Project: Nutch
> Issue Type: Task
> Affects Versions: 1.19
> Reporter: Markus Jelsma
> Priority: Major
> Fix For: 1.20
>
> Attachments: NUTCH-2959.patch
>
>
--
This message was sent by Atlassian Jira
(v8.20.10#820010)