Hi Romeo, I can only answer a few of your questions, but I'll do that.
> 2) How do I preprocess the image? I was a bit surprised to find out that > converting the image to grayscale before passing it to Tesseract results in > different (and generally better) accuracy. Why? Also inverting the image works > better for some text. What are the set of important transformations to play > with? Tesseract internally converts down to greyscale. I'd guess you saw better results when you did that before passing to Tesseract as the light/dark levels were better. More generally, try to preprocess as much as possible before passing to Tesseract. It sounds like you'd like to use Tesseract to determine where text is in a busy image, which is going to be tricky. You may well be able to do something like this using either the API or hOCR output + script, by only accepting letters which have high confidence (and hence ignoring speculative and incorrect things based on noise elsewhere in the image). It sounds like you're trying to move in that direction, which is interesting, and I look forward to hearing more of how you get on. Apologies I couldn't answer more of your questions. Hopefully someone else on the list will be able to. Nick -- -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en --- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. For more options, visit https://groups.google.com/groups/opt_out.

