(I permit myself to pick this topic up, again, after a break of a few months during which I had other obligations.)
My install is Debian, by now 5.0. I run tesseract out of the box. It works pretty well, except that - under 4.0 and now under 5.0 - all lowercase 'e' are recognised as lowercase 'c', irrespective of resolution or font size. Any optical inspection reveals the clear predominance of the horizontal stroke in the 'e'-s. Like before, I can't make out how to attach an image file that fails for us. I wonder, if anybody out there could please help me, to identify the setting in one of those configuration files so that it starts to recognize the lowercase 'e'-s properly. Maybe I should add that we don't feed it with any specific language/ dictionary. The character to be recognised here, are just supposed to be recognised as such. We only need tesseract to recognize the standard ASCII-128 characters. Thanks in advance, Uwe --~--~---------~--~----~------------~-------~--~----~ You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en -~----------~----~----~----~------~----~------~--~---

