Re: [tesseract-ocr] Can tesseract be used to read a PDF and OCR it to text?

'pjfarley3' via tesseract-ocr Fri, 17 Jan 2020 15:04:57 -0800

At least as of today the "add ons" part of the wiki doesn't actually have a 
PDF-to-OCR'ed-text wrapper as far as I can see.


Still searching for a solution, but thanks for trying to help.

Peter

On Monday, January 13, 2020 at 1:49:31 AM UTC-5, pjfarley3 wrote:
>
>
>
> On Sunday, January 12, 2020 at 8:52:51 PM UTC-5, shree wrote:
>>
>> Tesseract reads only image files, not pdf. You can convert PDF to image 
>> (tif, png) and OCR those.
>>
>> Or use wrappers that use tesseract.which take a PDF and convert to text. 
>> Look under add-ons in wiki.
>>
>>
> Thanks for that advice, I will check the wiki.
>
> Peter
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/5ee49b3f-05dc-494e-959d-93039e9ba33f%40googlegroups.com.

Re: [tesseract-ocr] Can tesseract be used to read a PDF and OCR it to text?

Reply via email to