If you're able to use OpenCV then, given a list of contours or blobs, you should be able to reconstruct a binary image. This is a general thought. To get a more practical advice, send us your sample image(s)
Warm regards, Dmitri Silaev www.CustomOCR.com On Sat, Nov 12, 2011 at 4:37 PM, daniel <[email protected]> wrote: > Hi, > > I want to use tesseract to read text off things like posters and > packages. The text will have different colors, there will be images > and other mess, so it seems like a non-standard situation. I thought > it would help if I use some opencv segmentation or contour finding > algorithm instead of the thresholding that tessearact seems to do. > That, however, will not provide a binary image, but a list of > components/contours. How can I feed this to tesseract? > > Best > > Daniel > > -- > You received this message because you are subscribed to the Google > Groups "tesseract-ocr" group. > To post to this group, send email to [email protected] > To unsubscribe from this group, send email to > [email protected] > For more options, visit this group at > http://groups.google.com/group/tesseract-ocr?hl=en > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en

