It seems to work here. I get the following result:

This is a test message. I
WO11Cl€1' if the OCR software
will be able to oohvert this
properly.
Jim

I called "tesseract.exe output.tif result -l eng"

Regards


Lothar

www.dornieden.org


On 20 Feb., 02:46, "Bryan D. Payne" <[email protected]> wrote:
> I'm a newbie to tesseract and hoping that someone can help.  I'd like
> to convert a screen capture image to text.  Here's the steps that I'm
> taking:
>
> * create screen capture
> * crop the image so that only the text is visible, with a white
> background
> * upscale the image to 300dpi
> * convert to 8-bit tiff
> * process using "tesseract output.tif text-output -l eng"
>
> After getting poor results, I started bumping up the font size before
> taking the screen capture.  However, this hasn't helped.  Currently,
> I'm working from a 300dpi, high resolution image where the lower case
> characters are about 95 pixels high.  You can see the image at the
> following link (http://www.bryanpayne.org/tmp/output.tif), note that
> it is 9.5 MB.
>
> The results that I'm getting for this image look like this:
>
> 'I`11is is El, te
> vv<>11ciI @r if 1
> vvill  I;>€ 2LI;‘>14
> ];)I°()];)€I`1}7-
> —]irr1
>
> Clearly not very good.  So my question is, what am I doing wrong?  It
> seems that the source image is about as ideal as it gets.  Yet,
> tesseract is having a lot of trouble with it.  I'm suspecting user
> error, so I'm hoping that someone can point me in the right
> direction.  Thanks!
--~--~---------~--~----~------------~-------~--~----~
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to 
[email protected]
For more options, visit this group at 
http://groups.google.com/group/tesseract-ocr?hl=en
-~----------~----~----~----~------~----~------~--~---

Reply via email to