The current source of Nutch uses Tika 1.7 as per repository in github. ( https://github.com/apache/nutch/commit/3e2e688bd097727f457f1aa882c74a128f0a53da ) As per Apache Tika 1.7 webpage, Tika 1.7 includes GDAL and Tesseract OCR (installation required). But the Nutch source does not have GDAL and Tesseract OCR in parse-tika plugin.
How to include GDAL and Tesseract OCR sources in Tika plugin for Nutch?

