Tika-Server - Tesseract - Output to PDF

Ralph Soika Wed, 24 Apr 2019 05:21:55 -0700

Hi,

I have a question about the Tesseract OCR Parser which is part of Tika:

Is it possible to define the output of tesseract to PDF format. I thinktesseract supports this option to convert a image file (e.g. tif) into asearchable pdf file:

$ tesseract --tessdata-dir ./ ./testing/eurotext.png./testing/eurotext-eng -l eng pdf

I use the tika Rest API and I wonder how I can tell tell the Tika Serverto create a PDF output file?



Thanks for any help


Ralph

Tika-Server - Tesseract - Output to PDF

Reply via email to