Yes, you will need to convert PDF to image formats that Tesseract accepts, such as TIFF, PNG... There are third-party software for that, such as GhostScript, ImageMagick, etc.
On Thursday, September 15, 2016 at 2:40:55 PM UTC-5, Simon Eigeldinger wrote: > > Hi all, > > can i use tesseract to open multipage pdfs directly? > we have multi function printers which produce pdfs with images which can > be run through ocr. > > how can i acomplish that for tesseract? > do i need a second program for that? > > greetings, > simon > > --- > Diese E-Mail wurde von Avast Antivirus-Software auf Viren geprüft. > https://www.avast.com/antivirus > > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at https://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/71eae8fb-b27f-4be4-8a3e-8a4dfda0d087%40googlegroups.com. For more options, visit https://groups.google.com/d/optout.

