Yes, using the 'digits' config file is exactly the same as using "-c tessedit_char_whitelist=0123456789.-" - if you look in the digits config file (tessdata/configs/digits) then you'll see that's all it does.
On Wed, Mar 26, 2014 at 12:39:33AM -0700, Faruk Terzioğlu wrote: > Hi Nick, > > "-c tessedit_char_whitelist=0123456789" is a good solution, I didn't know > changing configuration like that. Thank you for this technic. > > But calling "digits" config file like Neil did, also tighten the whitelist. > > 25 Mart 2014 Salı 18:29:21 UTC+2 tarihinde Nick White yazdı: > > Hi Neil, > > Faruk's advice is good, and additionally you could tighten the > digits whitelist to only include numbers (digits also includes - and > .), presuming you're sure that's all you'll see. > > You could do that by using a command like this: > > tesseract IMG_20140325_100806.jpg test1 -c tessedit_char_whitelist= > 0123456789 > > Tesseract is designed for printed pages, though, so it won't do well > for very skewed or crumpled images. If you can fix them before > passing to Tesseract, though, then it should work fine. > > Nick > -- -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en --- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. For more options, visit https://groups.google.com/d/optout.

