See https://github.com/tesseract-ocr/tesseract/wiki/User-Projects-%E2%80%93-3rdParty
I have personally used gImageReader and Vietocr. On Sat, Jan 18, 2020 at 4:34 AM 'pjfarley3' via tesseract-ocr < [email protected]> wrote: > At least as of today the "add ons" part of the wiki doesn't actually have > a PDF-to-OCR'ed-text wrapper as far as I can see. > > Still searching for a solution, but thanks for trying to help. > > Peter > > On Monday, January 13, 2020 at 1:49:31 AM UTC-5, pjfarley3 wrote: >> >> >> >> On Sunday, January 12, 2020 at 8:52:51 PM UTC-5, shree wrote: >>> >>> Tesseract reads only image files, not pdf. You can convert PDF to image >>> (tif, png) and OCR those. >>> >>> Or use wrappers that use tesseract.which take a PDF and convert to text. >>> Look under add-ons in wiki. >>> >>> >> Thanks for that advice, I will check the wiki. >> >> Peter >> > -- > You received this message because you are subscribed to the Google Groups > "tesseract-ocr" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to [email protected]. > To view this discussion on the web visit > https://groups.google.com/d/msgid/tesseract-ocr/5ee49b3f-05dc-494e-959d-93039e9ba33f%40googlegroups.com > <https://groups.google.com/d/msgid/tesseract-ocr/5ee49b3f-05dc-494e-959d-93039e9ba33f%40googlegroups.com?utm_medium=email&utm_source=footer> > . > -- ____________________________________________________________ भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduWXq3MK%2BnAkL6FbQTeXmDAKeix3C5ZT7MTQHQFPYDBTMw%40mail.gmail.com.

