[tesseract-ocr] Re: Latin language

2014-11-24 Thread Ryan Baumann
Pull requests or patches are more than welcome, as I'm just getting familiar with the Tesseract training process myself. I've just pushed a few changes to get possibly-better output for the training_text and word/frequency files, but incorporating Latin-specific changes for unicharambigs may

[tesseract-ocr] How can I get candidate for each letter

2014-11-24 Thread yx wang
Dear all, I am trying to do some improvement for the text recognized by Tesseract OCR. For some low quality picture,some letters may be mis-recognized with some similar letters, So I want to get the candidate letter for them to do some improvement, I have look through the source ode