I don't recommend the blacklist/whitelist approach because if you force Tess to recognize only digits, it will turn many letters into digits. We are using Tesseract 3.0 within our iPhone application http://www.scanbizcards.com and using that approach - there is a free version of the app if you want to use it to see what Tesseract 3.0 can do on your sample images. One caveat: absent the whitelist, Tesseract will often mistake some digits for letters, e.g. '|' instead of '1' - we handle this issue through regular expressions that accept digits alternative in areas where we know to expect digits then map these back to what they should be.
Patrick On Jun 29, 7:44 am, maddy <[email protected]> wrote: > Hi all, > > I am trying to use tesseract-2.04 in my iPhone application and just > want to detect the numbers. What I am doing here is first I am cross > compiling tesseract to generate lib file using this > posthttp://robertcarlsen.net/2009/07/15/cross-compiling-for-iphone-dev-884 > and then using the the demo application > athttp://robertcarlsen.net/2010/01/12/ocr-for-iphone-source-1080, but > the results far away than realistic. > > I am not able to resolve the issue or how to train tesseract so that > it comes closure for practical usage. > > Please help. > > Thanks, > > Madhup -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected]. To unsubscribe from this group, send email to [email protected]. For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en.

