Imho you're going in the wrong direction. Thresholding won't help.
I've thresholded the image manually with Photoshop and Tesseract's
result was the same as I for the original image.

You need to get the image in a higher res, yours is 72 DPI only,
that's not enough for Tesseract. In plain English it means you need
more pixels per letter.

Warm regards,
Dmitri Silaev





On Tue, Mar 29, 2011 at 7:15 PM, Robert P. J. Day <[email protected]> wrote:
>
>  hoping this query isn't wildly off-topic but i have an 8-bit B/W tif
> file (attached), which gthumb shows me to be eminently readable on my
> ubuntu system.  at this point, i'd like to use one of ubuntu's
> libtiff-tools utilities to convert it to the ideal 2-level B/W tif
> file that tesseract prefers, and i was playing with the "tiffdither"
> utility, but no matter what threshold i use, i can't get a clean B/W
> representation of what appears to be a perfectly legible 8-bit file.
>
>  perhaps i just don't understand what the threshold means WRT to
> dithering.  what would the proper step be to transform the attached
> file into the obvious single-bit tif equivalent?  thanks.
>
> rday
>
> --
>
> ========================================================================
> Robert P. J. Day                               Waterloo, Ontario, CANADA
>                        http://crashcourse.ca
>
> Twitter:                                       http://twitter.com/rpjday
> LinkedIn:                               http://ca.linkedin.com/in/rpjday
> ========================================================================
>
> --
> You received this message because you are subscribed to the Google Groups 
> "tesseract-ocr" group.
> To post to this group, send email to [email protected].
> To unsubscribe from this group, send email to 
> [email protected].
> For more options, visit this group at 
> http://groups.google.com/group/tesseract-ocr?hl=en.
>
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To post to this group, send email to [email protected].
To unsubscribe from this group, send email to 
[email protected].
For more options, visit this group at 
http://groups.google.com/group/tesseract-ocr?hl=en.

Reply via email to