On Mon, Apr 29, 2013 at 04:10:43AM -0700, Steven McArdle wrote: > What do you mean by "it doesn't support straight PDF" ?
I mean it only accepts image files. So you need to extract the images from the PDF before getting Tesseract to process them. Now I think of it, the 'pdfimages' tool is better for this than imagemagick, as it will extract without converting or losing any quality. But either would work fine (or Ghostscript, as you point out). Nick -- -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en --- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. For more options, visit https://groups.google.com/groups/opt_out.

