On 12 July 2010 18:35, rogerdpack <[email protected]> wrote: > Hi all. re-posting this in its own thread: > > > Overall I'm having no success getting tesseract to decode this file > that has a few digits on it, in either Linux or Windows. > > http://myfavoritepal.com/incoming/picture10.tif > > I am on XP, 2.04, 2.00 eng installed. It can't tell black from grey, > I assume? >
I think the problem is the font size: the characters look to be made of single-pixel lines, which tesseract just doesn't handle well (and neither does anything else I've ever used, for that matter). I think speckle detection is the cause of this, but that's just a hunch. The image looks to have been generated; if you can control generation, set a larger font size. -- <Leftmost> jimregan, that's because deep inside you, you are evil. <Leftmost> Also not-so-deep inside you. -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected]. To unsubscribe from this group, send email to [email protected]. For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en.

