Hey! Thanks for your suggestion Can you just give me a brief outline of the general pre processing step if I have this kind of images.
On Thursday, June 15, 2017 at 12:17:24 AM UTC+5:30, Andres wrote: > > The things that you mentione are not just details, they are subject of > wide study and specialization. > > From all the books from computer vision / image processing that I have, > the best one that addresses your enquiries is "Algorithms for Image > Processing and Computer Vision", from J.R. Parker, 2nd edition. ISBN > 978-0-470-64385-3 > > Cheers, > > Andres > > > El domingo, 11 de junio de 2017, 16:06:43 (UTC-3), [email protected] > escribió: >> >> I am trying to do do OCR using tesseract on images. I am unable to figure >> out a proper pre processing technique for the same. >> >> the problems I am facing is: >> >> 1. Low contrast images: The images have different texts with different >> font sizes. So what should be my approach to enhance the contrast of any >> image. >> >> 2. Problem of touching characters: Sometimes after applying adaptive >> thresholding I am facing the problem of touching characters (in which two >> adjacent characters are touching each other) What is the best way to figure >> out a solution for that. >> >> 3. Problem of non uniform illumination: How should I proceed if I want to >> solve the problem of non uniform illumination ? >> >> How can image segmentation solve my problem ? >> >> I have added a sample image. Assume that the image is not rotated as it >> is there in the picture. But the variety of font sizes and the text >> segments in the image are exact replica of what I am asking about ? Apart >> from above mentioned steps, I would appreciate any kind of suggestion for >> pre - processing of the above image. Let me know if you have worked out a >> solution for something related to this. >> >> Thanks >> >> >> -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at https://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/78fe75f2-c0bb-4bb7-9dab-ceb6a68a5b30%40googlegroups.com. For more options, visit https://groups.google.com/d/optout.

