if you only care about the four dark, printed lines you might be able to get by with a simple threshold since the characters are so much darker than the pattern. you can also use a sauvola threshold which very intelligently corrects for regional differences in luminosity in the image.
max On Jun 21, 2011, at 10:24 PM, Felipe Leal Coutinho wrote: > Hello, > > I'm try to use tesseract to make OCR of bank cheques captured from > digital cameras. As you can see (http://dl.dropbox.com/u/24085540/ > cheque-exemplo.jpg), these documents have a black text with a color > background (there isn't black color at the background). In order to > improve the results, I think that I will need to make some pre- > processing. Do you suggest something? I was thinking in remove the > background, but I didn't found any method to do that. > > Regards, > > Felipe. > > -- > You received this message because you are subscribed to the Google > Groups "tesseract-ocr" group. > To post to this group, send email to [email protected] > To unsubscribe from this group, send email to > [email protected] > For more options, visit this group at > http://groups.google.com/group/tesseract-ocr?hl=en -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en

