So, it looks like the text is too close to the edge. That is a known
problem. Perhaps you could manipulate the image to add space around
the image. ImageMagick and other libraries/tools can easily do that
with little CPU utilization.
--Sven


On Tue, Aug 30, 2011 at 10:08 AM, Rick Appleton <[email protected]> wrote:
> Hello all,
>
> I'm fairly new to Tesseract, so please forgive me if this is something
> that I can easily fix with a specific setting.
>
> I have two images which are extremely similar, yet give very different
> results.
>
> http://www.daedalus-development.net/ricka/Sheoldnd%20Whispering%20One.jpg
> http://www.daedalus-development.net/ricka/Sheoldred%20One.jpg
>
> The first image results in: 'Sheoldnd. Whispering One'
> The second image results in: 'Sheoldred.    One'
>
> The correct result should be: 'Sheoldred, Whispering One'
>
> The results in the first image are acceptable, and close enough for me
> to work with. However, the results from the second image are
> unacceptable to me. I appreciate that it has correctly detected the
> words it has found, but the fact that the middle word is missing
> entirely gives me lots of problems.
>
> Is this normal behaviour, or can I tweak Tesseract into giving me some
> kind of result for the middle word?
>
> Kind regards,
> Rick
>
> --
> You received this message because you are subscribed to the Google
> Groups "tesseract-ocr" group.
> To post to this group, send email to [email protected]
> To unsubscribe from this group, send email to
> [email protected]
> For more options, visit this group at
> http://groups.google.com/group/tesseract-ocr?hl=en
>



-- 
``All that is gold does not glitter,
  not all those who wander are lost;
the old that is strong does not wither,
  deep roots are not reached by the frost.
>From the ashes a fire shall be woken,
  a light from the shadows shall spring;
renewed shall be blade that was broken,
  the crownless again shall be king.”

-- 
You received this message because you are subscribed to the Google
Groups "tesseract-ocr" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to
[email protected]
For more options, visit this group at
http://groups.google.com/group/tesseract-ocr?hl=en

Reply via email to