Re: [tesseract-ocr] Can tesseract be used to read a PDF and OCR it to text?

Shree Devi Kumar Sat, 18 Jan 2020 00:09:35 -0800

See
https://github.com/tesseract-ocr/tesseract/wiki/User-Projects-%E2%80%93-3rdParty


I have personally used gImageReader and Vietocr.

On Sat, Jan 18, 2020 at 4:34 AM 'pjfarley3' via tesseract-ocr <
[email protected]> wrote:

> At least as of today the "add ons" part of the wiki doesn't actually have
> a PDF-to-OCR'ed-text wrapper as far as I can see.
>
> Still searching for a solution, but thanks for trying to help.
>
> Peter
>
> On Monday, January 13, 2020 at 1:49:31 AM UTC-5, pjfarley3 wrote:
>>
>>
>>
>> On Sunday, January 12, 2020 at 8:52:51 PM UTC-5, shree wrote:
>>>
>>> Tesseract reads only image files, not pdf. You can convert PDF to image
>>> (tif, png) and OCR those.
>>>
>>> Or use wrappers that use tesseract.which take a PDF and convert to text.
>>> Look under add-ons in wiki.
>>>
>>>
>> Thanks for that advice, I will check the wiki.
>>
>> Peter
>>
> --
> You received this message because you are subscribed to the Google Groups
> "tesseract-ocr" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to [email protected].
> To view this discussion on the web visit
> https://groups.google.com/d/msgid/tesseract-ocr/5ee49b3f-05dc-494e-959d-93039e9ba33f%40googlegroups.com
> <https://groups.google.com/d/msgid/tesseract-ocr/5ee49b3f-05dc-494e-959d-93039e9ba33f%40googlegroups.com?utm_medium=email&utm_source=footer>
> .
>


-- 

____________________________________________________________
भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduWXq3MK%2BnAkL6FbQTeXmDAKeix3C5ZT7MTQHQFPYDBTMw%40mail.gmail.com.

Re: [tesseract-ocr] Can tesseract be used to read a PDF and OCR it to text?

Reply via email to