sebastian-nagel commented on PR #849: URL: https://github.com/apache/nutch/pull/849#issuecomment-2684136660
> upgrade tika to 3.1.0 We currently use a shaded Tika package (2.9.1.0, thanks @tballison!) because of a conflict with the commons-io version required by Tika (or POI) and provided by Hadoop, see NUTCH-2959. Upgrading will force everybody to use at least Hadoop 3.4.0 in distributed mode. @maciejpuzianowski, could you provide the Hadoop version of your cluster? This may help to reproduce the issue and test alternative solutions, such as an upgrade to a more recent version of Tika. Thanks! -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: dev-unsubscr...@nutch.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org