At least as of today the "add ons" part of the wiki doesn't actually have a 
PDF-to-OCR'ed-text wrapper as far as I can see.

Still searching for a solution, but thanks for trying to help.

Peter

On Monday, January 13, 2020 at 1:49:31 AM UTC-5, pjfarley3 wrote:
>
>
>
> On Sunday, January 12, 2020 at 8:52:51 PM UTC-5, shree wrote:
>>
>> Tesseract reads only image files, not pdf. You can convert PDF to image 
>> (tif, png) and OCR those.
>>
>> Or use wrappers that use tesseract.which take a PDF and convert to text. 
>> Look under add-ons in wiki.
>>
>>
> Thanks for that advice, I will check the wiki.
>
> Peter
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/5ee49b3f-05dc-494e-959d-93039e9ba33f%40googlegroups.com.

Reply via email to