On 12 July 2010 18:35, rogerdpack <[email protected]> wrote:
> Hi all.  re-posting this in its own thread:
>
>
> Overall I'm having no success getting tesseract to decode this file
> that has a few digits on it, in either Linux or Windows.
>
> http://myfavoritepal.com/incoming/picture10.tif
>
> I am on XP, 2.04, 2.00 eng installed.  It can't tell black from grey,
> I assume?
>

I think the problem is the font size: the characters look to be made
of single-pixel lines, which tesseract just doesn't handle well (and
neither does anything else I've ever used, for that matter). I think
speckle detection is the cause of this, but that's just a hunch.

The image looks to have been generated; if you can control generation,
set a larger font size.


-- 
<Leftmost> jimregan, that's because deep inside you, you are evil.
<Leftmost> Also not-so-deep inside you.

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To post to this group, send email to [email protected].
To unsubscribe from this group, send email to 
[email protected].
For more options, visit this group at 
http://groups.google.com/group/tesseract-ocr?hl=en.

Reply via email to