[GitHub] [nutch] sebastian-nagel commented on pull request #776: NUTCH-2959 -- upgrade Tika to 2.9.0

2023-09-29 Thread via GitHub
sebastian-nagel commented on PR #776: URL: https://github.com/apache/nutch/pull/776#issuecomment-1740397046 > what do I use for the tika seeds file? Are you using our github repo, or the > tika-parsers-common package specifically see the comments in

[GitHub] [nutch] sebastian-nagel commented on pull request #776: NUTCH-2959 -- upgrade Tika to 2.9.0

2023-09-26 Thread via GitHub
sebastian-nagel commented on PR #776: URL: https://github.com/apache/nutch/pull/776#issuecomment-1736008780 > I suggest that we downgrade to Tika 2.2.1 to fix that regression. Good point, @lewismc. I've opened NUTCH-3006 for that. -- This is an automated message from the Apache

[GitHub] [nutch] sebastian-nagel commented on pull request #776: NUTCH-2959 -- upgrade Tika to 2.9.0

2023-09-19 Thread via GitHub
sebastian-nagel commented on PR #776: URL: https://github.com/apache/nutch/pull/776#issuecomment-1725795918 > Can we exclude commons-io from hadoop and then add it as a dependency in the main ivy.xml? When running in distributed or pseudo-distributed mode, commons-io 2.8.0 is first