[tesseract-ocr] Distinguishing between "1" and "/"

Paul Grebenc Thu, 09 Feb 2017 08:38:01 -0800

I'm working with Tesseract 3.02, trying to perform an OCR on an original 
source image (not scanned, so there is no noise or other artifacts).  The 
image contains the text "6582044/1", but it is detecting "6582044I1".


I've tried setting tessedit_char_whitelist to "/0123456789" (because in 
this case I know my input text will only contains digits and slashes), but 
the result is then "658204411".

Is there anything I should try, to improve the accuracy in distinguishing 
forward slashes?

<https://lh3.googleusercontent.com/-z7KO2G9QVis/WJyVVatHc0I/AAAAAAAAGBg/wisWnP1YCewoP0YMOc78Xeo27_m9V2LOgCLcB/s1600/ocrinput_bw_works.png>


Thanks,
Paul

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To post to this group, send email to [email protected].
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/474d12c4-1bd7-4c65-b882-f4329066c758%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

[tesseract-ocr] Distinguishing between "1" and "/"

Reply via email to