Hi Mr. Sriranga,
I think maybe because his files are JPEG the conversion to BMP
produced a low quality file. Perhaps if you use a manual conversion --
not copy and paste -- it will produce different results? I'm pretty
sure the dark line at the bottom and the lack of border / margins are
the problem, along with perhaps the dark gray background color.
--Sven


On Tue, Aug 30, 2011 at 10:18 PM, Sriranga(78yrsold)
<[email protected]> wrote:
> i copied the image and saved in paintbrush as bmp wne tested using r527
> output text is blank.
>
> On Tue, Aug 30, 2011 at 10:27 PM, Dmitri Silaev <[email protected]>
> wrote:
>>
>> My answer can add to Sven's. Although close edges can be indeed a
>> problem, and Tesseract feels better when a bigger background area
>> surrounds the target text, probably you'd want to pay attention to the
>> lower edge of the "Sheoldred One.jpg" image. There's a thin dark line
>> along the edge - that's the main difference between your images. This
>> line can confuse Tesseract and be the reason of different recognition
>> results. Passing as clean as possible images to Tesseract would let
>> you achieve better recognition.
>>
>> HTH
>>
>> Warm regards,
>> Dmitri Silaev
>> www.CustomOCR.com
>>
>>
>>
>>
>>
>> On Tue, Aug 30, 2011 at 7:08 PM, Rick Appleton <[email protected]>
>> wrote:
>> > Hello all,
>> >
>> > I'm fairly new to Tesseract, so please forgive me if this is something
>> > that I can easily fix with a specific setting.
>> >
>> > I have two images which are extremely similar, yet give very different
>> > results.
>> >
>> >
>> > http://www.daedalus-development.net/ricka/Sheoldnd%20Whispering%20One.jpg
>> > http://www.daedalus-development.net/ricka/Sheoldred%20One.jpg
>> >
>> > The first image results in: 'Sheoldnd. Whispering One'
>> > The second image results in: 'Sheoldred.    One'
>> >
>> > The correct result should be: 'Sheoldred, Whispering One'
>> >
>> > The results in the first image are acceptable, and close enough for me
>> > to work with. However, the results from the second image are
>> > unacceptable to me. I appreciate that it has correctly detected the
>> > words it has found, but the fact that the middle word is missing
>> > entirely gives me lots of problems.
>> >
>> > Is this normal behaviour, or can I tweak Tesseract into giving me some
>> > kind of result for the middle word?
>> >
>> > Kind regards,
>> > Rick
>> >
>> > --
>> > You received this message because you are subscribed to the Google
>> > Groups "tesseract-ocr" group.
>> > To post to this group, send email to [email protected]
>> > To unsubscribe from this group, send email to
>> > [email protected]
>> > For more options, visit this group at
>> > http://groups.google.com/group/tesseract-ocr?hl=en
>> >
>>
>> --
>> You received this message because you are subscribed to the Google
>> Groups "tesseract-ocr" group.
>> To post to this group, send email to [email protected]
>> To unsubscribe from this group, send email to
>> [email protected]
>> For more options, visit this group at
>> http://groups.google.com/group/tesseract-ocr?hl=en
>
> --
> You received this message because you are subscribed to the Google
> Groups "tesseract-ocr" group.
> To post to this group, send email to [email protected]
> To unsubscribe from this group, send email to
> [email protected]
> For more options, visit this group at
> http://groups.google.com/group/tesseract-ocr?hl=en
>



-- 
``All that is gold does not glitter,
  not all those who wander are lost;
the old that is strong does not wither,
  deep roots are not reached by the frost.
>From the ashes a fire shall be woken,
  a light from the shadows shall spring;
renewed shall be blade that was broken,
  the crownless again shall be king.”

-- 
You received this message because you are subscribed to the Google
Groups "tesseract-ocr" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to
[email protected]
For more options, visit this group at
http://groups.google.com/group/tesseract-ocr?hl=en

Reply via email to