Hi Mr. Sriranga, I think maybe because his files are JPEG the conversion to BMP produced a low quality file. Perhaps if you use a manual conversion -- not copy and paste -- it will produce different results? I'm pretty sure the dark line at the bottom and the lack of border / margins are the problem, along with perhaps the dark gray background color. --Sven
On Tue, Aug 30, 2011 at 10:18 PM, Sriranga(78yrsold) <[email protected]> wrote: > i copied the image and saved in paintbrush as bmp wne tested using r527 > output text is blank. > > On Tue, Aug 30, 2011 at 10:27 PM, Dmitri Silaev <[email protected]> > wrote: >> >> My answer can add to Sven's. Although close edges can be indeed a >> problem, and Tesseract feels better when a bigger background area >> surrounds the target text, probably you'd want to pay attention to the >> lower edge of the "Sheoldred One.jpg" image. There's a thin dark line >> along the edge - that's the main difference between your images. This >> line can confuse Tesseract and be the reason of different recognition >> results. Passing as clean as possible images to Tesseract would let >> you achieve better recognition. >> >> HTH >> >> Warm regards, >> Dmitri Silaev >> www.CustomOCR.com >> >> >> >> >> >> On Tue, Aug 30, 2011 at 7:08 PM, Rick Appleton <[email protected]> >> wrote: >> > Hello all, >> > >> > I'm fairly new to Tesseract, so please forgive me if this is something >> > that I can easily fix with a specific setting. >> > >> > I have two images which are extremely similar, yet give very different >> > results. >> > >> > >> > http://www.daedalus-development.net/ricka/Sheoldnd%20Whispering%20One.jpg >> > http://www.daedalus-development.net/ricka/Sheoldred%20One.jpg >> > >> > The first image results in: 'Sheoldnd. Whispering One' >> > The second image results in: 'Sheoldred. One' >> > >> > The correct result should be: 'Sheoldred, Whispering One' >> > >> > The results in the first image are acceptable, and close enough for me >> > to work with. However, the results from the second image are >> > unacceptable to me. I appreciate that it has correctly detected the >> > words it has found, but the fact that the middle word is missing >> > entirely gives me lots of problems. >> > >> > Is this normal behaviour, or can I tweak Tesseract into giving me some >> > kind of result for the middle word? >> > >> > Kind regards, >> > Rick >> > >> > -- >> > You received this message because you are subscribed to the Google >> > Groups "tesseract-ocr" group. >> > To post to this group, send email to [email protected] >> > To unsubscribe from this group, send email to >> > [email protected] >> > For more options, visit this group at >> > http://groups.google.com/group/tesseract-ocr?hl=en >> > >> >> -- >> You received this message because you are subscribed to the Google >> Groups "tesseract-ocr" group. >> To post to this group, send email to [email protected] >> To unsubscribe from this group, send email to >> [email protected] >> For more options, visit this group at >> http://groups.google.com/group/tesseract-ocr?hl=en > > -- > You received this message because you are subscribed to the Google > Groups "tesseract-ocr" group. > To post to this group, send email to [email protected] > To unsubscribe from this group, send email to > [email protected] > For more options, visit this group at > http://groups.google.com/group/tesseract-ocr?hl=en > -- ``All that is gold does not glitter, not all those who wander are lost; the old that is strong does not wither, deep roots are not reached by the frost. >From the ashes a fire shall be woken, a light from the shadows shall spring; renewed shall be blade that was broken, the crownless again shall be king.” -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en

