On Thu, 10 Jun 2021, Cristian Zamfir wrote:
It would be nice if this was feasible via the headers of each request. I find it more convenient to use if/else in my code than in the yaml files used for k8s configuration. Is there such an option?

Three options, see https://cwiki.apache.org/confluence/display/TIKA/TikaOCR#TikaOCR-DisableOCRinTikadisable-ocr
 * Don't install tesseract on the machine hosting Tika
 * Supply a Tika Config file that disables the Tesseract parser
 * Send the Server the custom header X-Tika-OCRskipOcr: true

Nick

Reply via email to