I am using tesseract v4.0 beta 1 and trying to OCR remittance file. There 
is one section which has CHECK NO, but tesseract doesn't seem to recognize 
it at all.

I have tried with setting dictionary words and penalties to 1 for non 
dictionary words, yet no change.

tesseract capture.png captureoutput1 --user-words "C:\Program Files 
(x86)\Tesseract-OCR\tessdata\eng.user-words" -c load_system_dawg=0 -c 
load_freq_dawg=0 -c language_model_penalty_non_dict_word=1 -c 
language_model_penalty_non_freq_dict_word=1

These are the words I have in eng.user-words.

CHECK NO.
CHECK
NO
check
no

Any idea how to fix this?

Thanks,
Hari

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To post to this group, send email to [email protected].
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/01ef5e64-3332-4b0f-a0aa-8ab9488083f1%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
cueck wo. 150744


Reply via email to