Maybe a few "select by color" loops to pull the numbers out on their own? I have done this kind of thing with gimp scripts but it could be done programmatically. This might be a case where you can make the colors work for you.
art From: [email protected] [mailto:[email protected]] On Behalf Of Aaron G Sent: Tuesday, April 21, 2015 8:34 PM To: [email protected] Subject: [tesseract-ocr] OCR From Small Graphs Hello - First, thank you to everyone supporting this tool... I've had pretty good success with it in the past, but am running into an issue, I'm hoping someone may be able to help with. I receive images similar to the attached, and based on what I have read, it sounds like tesseract may have issues with this type of test, but I'm wondering if anyone has any thoughts on the best way to pull the numbers out. I've tried converting to grayscale, increasing density, etc... but have not had any success. I'm running tesseract 3.02 on Ubuntu 12.04. Any help would be greatly appreciated. Thanks, Aaron -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]<mailto:[email protected]>. To post to this group, send email to [email protected]<mailto:[email protected]>. Visit this group at http://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/74c2f8d4-ace8-4993-be57-d2c72550e013%40googlegroups.com<https://groups.google.com/d/msgid/tesseract-ocr/74c2f8d4-ace8-4993-be57-d2c72550e013%40googlegroups.com?utm_medium=email&utm_source=footer>. For more options, visit https://groups.google.com/d/optout. -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at http://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/BY2PR11MB0743FF737A679BF1DEAA8D31DCEE0%40BY2PR11MB0743.namprd11.prod.outlook.com. For more options, visit https://groups.google.com/d/optout.

