If you distribute to Windows users, you could compile tesseract in a static
way, bundled with leptonica and MS VC++ Redistributabe dlls. So you could
embed tesseract in a folder of your app and configure that info into
ParserContext:

TesseractOCRConfig ocrConfig = new TesseractOCRConfig();
config.setTesseractPath(path_to_tesseract);
config.setTessdataPath(path_to_tessdata);
parseContext.set(TesseractOCRConfig .class, ocrConfig);
tesseractOCRParser.parse(inputStream, metadata, handler, parseContext);

Of course it is better to use or create your own docker image with all
dependencies and environment variables configured properly.

Regards,
Luis


2017-06-13 8:46 GMT-03:00 Allison, Timothy B. <[email protected]>:

> Agreed.  Thank you, David.  Y, you need to install tesseract yourself, and
> it has to be callable via the commandline (or you need to tell Tika where
> to find it).
>
> Dave Meikle has a docker container for tika-server with tesseract already
> installed: https://github.com/LogicalSpark/docker-tikaserver
>
>
>
> *From:* David Pilato [mailto:[email protected]]
> *Sent:* Tuesday, June 13, 2017 6:21 AM
> *To:* [email protected]
> *Subject:* Re: How to use TesseractOCRParser etc. in Apache Tika 1.14
> without installing tesseract-ocr separately on system
>
>
>
> I don't believe you can do OCR today without installing Tesseract.
>
>
> Le 13 juin 2017 à 12:11 +0200, Achint Satsangi <[email protected]>,
> a écrit :
>
> Dear All
>
> Greetings to the amazing Tika community !
>
> I want to share my java application which performs OCR on images using
> Apache Tika 1.14 using TesseractOCRParser() etc. but I found that without
> installing tesseract-ocr on my system (Ubuntu 14.04) the OCR functionality
> was not working
>
> I need to find out how can I do away the need of installation of
> tesseract-ocr on the  systems for TesseractOCRParser() and other OCR
> functions to work fine so I can share my work with others without them
> needing to install tesseract separately
>
> Thanks in advance
>
> Best Regards
>
> Achint
>
>

Reply via email to