Hello,

I am trying tesseract 4 for a project to read digits on different surfaces. 

currently I invoke tesseract with the following options :
--oem 1 -l eng -c tessedit_char_whitelist=0123456789 digits

when using the "best" testdata 
(from https://github.com/tesseract-ocr/tessdata/tree/master/best ) it only 
works in oem 1 mode and returns all kinds of characters, it completely 
ignores the char whitelist and/or the digits directive

I am wondering how to get the best out of tesseract for digits only ?

How to get the so called "best" traineddata in digits only mode ?

Is there somewhere some traineddata focusing on digits only (from different 
fonts ?) ?

Is my setup with tesseract4 and the options above the best way to run on 
digits ?

Thank you very much for the help.

Declic

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To post to this group, send email to tesseract-ocr@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/d80c9cb7-ffc2-4f3d-98c7-3402cb6451df%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Reply via email to