So, it looks like the text is too close to the edge. That is a known problem. Perhaps you could manipulate the image to add space around the image. ImageMagick and other libraries/tools can easily do that with little CPU utilization. --Sven
On Tue, Aug 30, 2011 at 10:08 AM, Rick Appleton <[email protected]> wrote: > Hello all, > > I'm fairly new to Tesseract, so please forgive me if this is something > that I can easily fix with a specific setting. > > I have two images which are extremely similar, yet give very different > results. > > http://www.daedalus-development.net/ricka/Sheoldnd%20Whispering%20One.jpg > http://www.daedalus-development.net/ricka/Sheoldred%20One.jpg > > The first image results in: 'Sheoldnd. Whispering One' > The second image results in: 'Sheoldred. One' > > The correct result should be: 'Sheoldred, Whispering One' > > The results in the first image are acceptable, and close enough for me > to work with. However, the results from the second image are > unacceptable to me. I appreciate that it has correctly detected the > words it has found, but the fact that the middle word is missing > entirely gives me lots of problems. > > Is this normal behaviour, or can I tweak Tesseract into giving me some > kind of result for the middle word? > > Kind regards, > Rick > > -- > You received this message because you are subscribed to the Google > Groups "tesseract-ocr" group. > To post to this group, send email to [email protected] > To unsubscribe from this group, send email to > [email protected] > For more options, visit this group at > http://groups.google.com/group/tesseract-ocr?hl=en > -- ``All that is gold does not glitter, not all those who wander are lost; the old that is strong does not wither, deep roots are not reached by the frost. >From the ashes a fire shall be woken, a light from the shadows shall spring; renewed shall be blade that was broken, the crownless again shall be king.” -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en

