Sochenda, It uses Ghostscript to convert PDF documents to images, which are then fed to Tess engine. Instructions on configuration are in the program's readme or published on its website.
Quan On Feb 7, 9:23 pm, KHEM Sochenda <[email protected]> wrote: > Dear Quan, > > I would like to know how to let tesseract OCR work with pdf documents. > > Thank you very much in advance for you kind response. > > With Best Regards, > > Sochenda > > On Tue, Feb 8, 2011 at 7:56 AM, Quan Nguyen <[email protected]> wrote: > > A Java/.NET GUI frontend for Tesseract OCR engine. The releases > > include the following fixes and improvements: > > > * Add support for spellcheck suggestion in context menu > > * Improve program accessibility and usability > > * Add support for downloading and installing language data packs and > > appropriate spell dictionaries > > * Add UI localization for Lithuanian and Slovak > > * Update Tesseract OCR engine to 3.01 (r551) (v3.1 only) > > >http://vietocr.sf.net > > > -- > > You received this message because you are subscribed to the Google Groups > > "tesseract-ocr" group. > > To post to this group, send email to [email protected]. > > To unsubscribe from this group, send email to > > [email protected]. > > For more options, visit this group at > >http://groups.google.com/group/tesseract-ocr?hl=en. -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected]. To unsubscribe from this group, send email to [email protected]. For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en.

