On Fri, 17 Oct 2014, Rick Leir wrote:
I opened the jpg in Gimp, and you can see that it is about
100 pixels per text line:
[gimpOriginal.png]
That image looks to be scanned at about 150 dpi. With
such faint characters, scanning at 300 or 600 dpi would
have been better. Anyway, try scaling the images up
by a factor of two. Also try an "adaptive binarization"
algorithm to convert to black and white. Google
"wolf binarization" for one example of such an
algorithm. I tried myself on your example image, and
although it still didn't look that great, I can image
how bad it would look if a threshold binarization
algorithm was used.
Rob Komar
--
You received this message because you are subscribed to the Google Groups
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email
to [email protected].
To post to this group, send email to [email protected].
Visit this group at http://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit
https://groups.google.com/d/msgid/tesseract-ocr/alpine.LNX.2.02.1410171021090.10035%40robpc4.home.org.
For more options, visit https://groups.google.com/d/optout.