Hi all Thanks for the information.
I have increased the DPI also but some word are missing attached output image. I have attached the image properties. the file compression type CCITT and bit depth is 1. Does compression type and bit depth is depended on OCR process? Looking forward your reply. Regards Guna On Tuesday, June 23, 2015 at 12:03:40 AM UTC+5:30, Art Rhyno wrote: > > Hi Guna, > > > > I usually find that tesseract has trouble with text on lines in a form, > there is a horizontal line removal example included with leptonica that > might help you [1]. I tried it on the sample you provided, and doubled the > size of the image to start zeroing in on the results. You might also > consider font training for characters that would be impacted by removing > the line (since it can take the bottom part of the letter away if the text > is typed right on the line). > > > > art > > --- > > 1. http://www.leptonica.com/line-removal.html > > > > *From:* [email protected] <javascript:> [mailto: > [email protected] <javascript:>] *On Behalf Of *Gunasekaran Velu > *Sent:* Monday, June 22, 2015 11:06 AM > *To:* [email protected] <javascript:> > *Subject:* [tesseract-ocr] Re: Improve OCR accuracy > > > > > > Hi > > > > Thanks for the reply. > > > > I am using Tesseract .NET Wrapper version 2.0.4.0. > > > > Looking forward your reply. > > > > Regards > > Guna > > On Monday, June 22, 2015 at 6:41:06 PM UTC+5:30, supriya Das wrote: > > Which version of Tesseract are you using ? > > On Monday, 22 June 2015 17:26:51 UTC+5:30, Gunasekaran Velu wrote: > > > > > > HI > > > > I have attached the image as well as Tesseract OCR result for attached > image screen shot. the below OCR some words are missing from OCR how can i > improve the image quality to detect the missing words. > > > > The attached image DPI are > > > > Horizontal resolution - 204 DPI > > Vertical resolution - 98 DPI > > > > Please help me to improve the OCR accuracy. > > > > Looking forward your reply. > > > > Regards > > Guna > > > > *Error! Filename not specified.* > > -- > You received this message because you are subscribed to the Google Groups > "tesseract-ocr" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to [email protected] <javascript:>. > To post to this group, send email to [email protected] > <javascript:>. > Visit this group at http://groups.google.com/group/tesseract-ocr. > To view this discussion on the web visit > https://groups.google.com/d/msgid/tesseract-ocr/e725e8e6-dd6f-4c4c-9bb9-61f86c49053c%40googlegroups.com > > <https://groups.google.com/d/msgid/tesseract-ocr/e725e8e6-dd6f-4c4c-9bb9-61f86c49053c%40googlegroups.com?utm_medium=email&utm_source=footer> > . > For more options, visit https://groups.google.com/d/optout. > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at http://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/23e22348-cf56-414c-a2f4-6397d4c18bf0%40googlegroups.com. For more options, visit https://groups.google.com/d/optout.

