Hello. I am new in OCR, and now I got a problem.
I have got a program generated tiff image (white background without noise, and not so regular font type). The program generate this characters: ö ü , but lot's of time the tesseract recognize: o and u. And lots of time the o and u charaters recognized as ö and ü Maybe the tesseract think that the dot at the top of the char is noise, but it isn't. In the picture there is no noise at all! Is there any config parameter to change the heuristics? Thanks Barnabas -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en

