The current source of Nutch uses Tika 1.7 as per repository in github. (
https://github.com/apache/nutch/commit/3e2e688bd097727f457f1aa882c74a128f0a53da
)
As per Apache Tika 1.7 webpage, Tika 1.7 includes GDAL and Tesseract OCR
(installation required).
But the Nutch source does not have GDAL and Tesseract OCR in parse-tika
plugin.

How to include GDAL and Tesseract OCR sources in Tika plugin for Nutch?

Reply via email to