On 5 July 2010 22:00, fontenot.1031 <[email protected]> wrote: > My question is: are they any other better options to use when > converting from pdf to .jpg? > >> it's quite likely that the resolution was chosen specifically so nobody >> would be able to use OCR on the scans. > > The original PDF is of high quality. Here's a link to it: > http://www.lecanardduloir.com/Docs/CamusLetranger.pdf
Just use pdfimages then (it comes with xpdf), and use ImageMagick's convert to convert from pbm to tiff. The PDF as is looks like it's ideal for OCR (and the pbm images extracted will be the same). -- <Leftmost> jimregan, that's because deep inside you, you are evil. <Leftmost> Also not-so-deep inside you. -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected]. To unsubscribe from this group, send email to [email protected]. For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en.

