At least as of today the "add ons" part of the wiki doesn't actually have a PDF-to-OCR'ed-text wrapper as far as I can see.
Still searching for a solution, but thanks for trying to help. Peter On Monday, January 13, 2020 at 1:49:31 AM UTC-5, pjfarley3 wrote: > > > > On Sunday, January 12, 2020 at 8:52:51 PM UTC-5, shree wrote: >> >> Tesseract reads only image files, not pdf. You can convert PDF to image >> (tif, png) and OCR those. >> >> Or use wrappers that use tesseract.which take a PDF and convert to text. >> Look under add-ons in wiki. >> >> > Thanks for that advice, I will check the wiki. > > Peter > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/5ee49b3f-05dc-494e-959d-93039e9ba33f%40googlegroups.com.

