Agreed. Thank you, David. Y, you need to install tesseract yourself, and it has to be callable via the commandline (or you need to tell Tika where to find it). Dave Meikle has a docker container for tika-server with tesseract already installed: https://github.com/LogicalSpark/docker-tikaserver
From: David Pilato [mailto:[email protected]] Sent: Tuesday, June 13, 2017 6:21 AM To: [email protected] Subject: Re: How to use TesseractOCRParser etc. in Apache Tika 1.14 without installing tesseract-ocr separately on system I don't believe you can do OCR today without installing Tesseract. Le 13 juin 2017 à 12:11 +0200, Achint Satsangi <[email protected]<mailto:[email protected]>>, a écrit : Dear All Greetings to the amazing Tika community ! I want to share my java application which performs OCR on images using Apache Tika 1.14 using TesseractOCRParser() etc. but I found that without installing tesseract-ocr on my system (Ubuntu 14.04) the OCR functionality was not working I need to find out how can I do away the need of installation of tesseract-ocr on the systems for TesseractOCRParser() and other OCR functions to work fine so I can share my work with others without them needing to install tesseract separately Thanks in advance Best Regards Achint
