I'm working with Tesseract 3.02, trying to perform an OCR on an original source image (not scanned, so there is no noise or other artifacts). The image contains the text "6582044/1", but it is detecting "6582044I1".
I've tried setting tessedit_char_whitelist to "/0123456789" (because in this case I know my input text will only contains digits and slashes), but the result is then "658204411". Is there anything I should try, to improve the accuracy in distinguishing forward slashes? <https://lh3.googleusercontent.com/-z7KO2G9QVis/WJyVVatHc0I/AAAAAAAAGBg/wisWnP1YCewoP0YMOc78Xeo27_m9V2LOgCLcB/s1600/ocrinput_bw_works.png> Thanks, Paul -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at https://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/474d12c4-1bd7-4c65-b882-f4329066c758%40googlegroups.com. For more options, visit https://groups.google.com/d/optout.

