Hello, I tested the Tesseract that the accuracy will be highly increased by using frequent words list which is added the the traineddata as the freq-dawg form.
The originally error result will be fixed according to the frequent words list to become correct one. (it's magic!!) Now I want to know if a word if is recognized, then how the Tesseract judge if the result is among the frequent words list? (which code snippet can I study?) Tesseract will fix the result if the original result is similar to one word from the frequent words list. How can I get the similarity degree if the result is fixed to become one of the the word from frequent words list? (the similarity degree between the recognized word and one of the frequent words list must be high enough to make such a decision) What I want is to get the similarity degree (or matching rate?) from the recognizing process. (I want to fixed the source code to output the similarity degree) Thanks in advance. (Sorry for my poor English, if you don't catch my problem , please let me know ^^) -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at http://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/e4dfb4c8-7094-4b55-bb9d-fb16abfd11f8%40googlegroups.com. For more options, visit https://groups.google.com/d/optout.

