You'll need to upscale the image. Before reducing it to Black-and-White. Reducing to B+W isn't essential.

fontenot.1031 wrote, On 2010-07-03 01:23:
Hey. I have a bunch of .jpg files of the pages of the book L'Etranger
that I need to OCR. However, when I convert them into a .tif file so
that tesseract can read them it doesn't read anything (even though the
text is fairly clear).
I'm using this to convert the .jpg files into .tif files:

convert page-4.jpg -depth 2 page-4.tif

Then when I execute: tesseract page-4.tif page-4 -l fra I just get a
text file with two empty lines.

Here's a link to the exact .jpg I'm using:
http://imgur.com/j7f5E.jpg

Does anyone know what I might be doing wrong?


--
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To post to this group, send email to [email protected].
To unsubscribe from this group, send email to 
[email protected].
For more options, visit this group at 
http://groups.google.com/group/tesseract-ocr?hl=en.

Reply via email to