I'm working with Tesseract 3.02, trying to perform an OCR on an original 
source image (not scanned, so there is no noise or other artifacts).  The 
image contains the text "6582044/1", but it is detecting "6582044I1".

I've tried setting tessedit_char_whitelist to "/0123456789" (because in 
this case I know my input text will only contains digits and slashes), but 
the result is then "658204411".

Is there anything I should try, to improve the accuracy in distinguishing 
forward slashes?

<https://lh3.googleusercontent.com/-z7KO2G9QVis/WJyVVatHc0I/AAAAAAAAGBg/wisWnP1YCewoP0YMOc78Xeo27_m9V2LOgCLcB/s1600/ocrinput_bw_works.png>


Thanks,
Paul

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To post to this group, send email to [email protected].
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/474d12c4-1bd7-4c65-b882-f4329066c758%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Reply via email to