My answer can add to Sven's. Although close edges can be indeed a
problem, and Tesseract feels better when a bigger background area
surrounds the target text, probably you'd want to pay attention to the
lower edge of the "Sheoldred One.jpg" image. There's a thin dark line
along the edge - that's the main difference between your images. This
line can confuse Tesseract and be the reason of different recognition
results. Passing as clean as possible images to Tesseract would let
you achieve better recognition.

HTH

Warm regards,
Dmitri Silaev
www.CustomOCR.com





On Tue, Aug 30, 2011 at 7:08 PM, Rick Appleton <[email protected]> wrote:
> Hello all,
>
> I'm fairly new to Tesseract, so please forgive me if this is something
> that I can easily fix with a specific setting.
>
> I have two images which are extremely similar, yet give very different
> results.
>
> http://www.daedalus-development.net/ricka/Sheoldnd%20Whispering%20One.jpg
> http://www.daedalus-development.net/ricka/Sheoldred%20One.jpg
>
> The first image results in: 'Sheoldnd. Whispering One'
> The second image results in: 'Sheoldred.    One'
>
> The correct result should be: 'Sheoldred, Whispering One'
>
> The results in the first image are acceptable, and close enough for me
> to work with. However, the results from the second image are
> unacceptable to me. I appreciate that it has correctly detected the
> words it has found, but the fact that the middle word is missing
> entirely gives me lots of problems.
>
> Is this normal behaviour, or can I tweak Tesseract into giving me some
> kind of result for the middle word?
>
> Kind regards,
> Rick
>
> --
> You received this message because you are subscribed to the Google
> Groups "tesseract-ocr" group.
> To post to this group, send email to [email protected]
> To unsubscribe from this group, send email to
> [email protected]
> For more options, visit this group at
> http://groups.google.com/group/tesseract-ocr?hl=en
>

-- 
You received this message because you are subscribed to the Google
Groups "tesseract-ocr" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to
[email protected]
For more options, visit this group at
http://groups.google.com/group/tesseract-ocr?hl=en

Reply via email to