Thank you very much for your valuable suggestion. Can you just help me out in how to remove the horizontal lines as I am processing this image in C# code and is there any tool which i can use to remove the horizontal line or any code snippet i can refer.
On Saturday, August 27, 2016 at 10:04:10 PM UTC+5:30, Quan Nguyen wrote: > > Deskew, grayscale, remove lines, binarize produced the image: > > > <https://lh3.googleusercontent.com/-k4IAE2W2W7M/V8HAYJhIP5I/AAAAAAAAAqg/C85uxC7JDOMikMfAX_whlGB8UBU2Y1BiACLcB/s1600/Capture4.PNG> > > and OCRed text: > > l4|0|0l2|1l1>°l0|7l > > So if you could remove the vertical lines, it would improve further. > > On Saturday, August 27, 2016 at 10:29:52 AM UTC-5, shripad shirsat wrote: >> >> >> I am facing to issue to recognize the numbers from pdf which are printed >> within the boxes. I have used tesseract in C# for my project. Kindly some >> one help me out with any clue or hint or a snippet to how to go about to >> find the solution for the same. Please find the attached pdf >> > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at https://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/b5015bbf-8700-43c6-bd44-72892811cf27%40googlegroups.com. For more options, visit https://groups.google.com/d/optout.

