[ https://issues.apache.org/jira/browse/NUTCH-2959?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17725839#comment-17725839 ]
Sebastian Nagel commented on NUTCH-2959: ---------------------------------------- Hi [~tallison], if running in local mode it might be a good option to delegate the parsing to a separate process. When running on a Hadoop cluster, it might cause some headaches to get the process running on the task nodes. > Upgrade to Apache Tika 2.4.1 > ---------------------------- > > Key: NUTCH-2959 > URL: https://issues.apache.org/jira/browse/NUTCH-2959 > Project: Nutch > Issue Type: Task > Affects Versions: 1.19 > Reporter: Markus Jelsma > Priority: Major > Fix For: 1.20 > > Attachments: NUTCH-2959.patch > > -- This message was sent by Atlassian Jira (v8.20.10#820010)