Re: [tesseract-ocr] Trained font - always one letter wrong

2018-05-01 Thread dave . hardy
Training doesn't work. If i use the characters "ä, ö, ü" (which i need) in my training text, text2image says: WARNING: illegal UTF8 encountered and then creates an incorrect box/tif pair. This seems not to depend on my font, because with Arial it does the same thing. Can you help me to avoid

Re: [tesseract-ocr] Trained font - always one letter wrong

2018-05-02 Thread dave . hardy
Thanks for your effort! I tried language deu before and as you can see in your attached txts, there are some faults too. I could not eliminate them using freq- or user-words. But in general your result with deu is much better Than mine with v 3.05. -- You received this message because you are

Re: [tesseract-ocr] Trained font - always one letter wrong

2018-04-29 Thread dave . hardy
I did. Unfortunately they don't aswer... Have you any advice for me, to improve the training proccess? How many training texts should i use? Or is it possible that there is a problem with this font at all? Would help very much to find that out. Best regards Dave -- You received this message

[tesseract-ocr] Trained font - always one letter wrong

2018-04-25 Thread dave . hardy
Hello there, i don't know what to do anymore... I want to use tesseract-ocr 3.05 for scanning documents, using the font "Perfect DOS VGA 437 Win". Got a traineddata file for my font from trainyourtesseract.com, actual it works really nice but in every case the letter "d" isnt identified but