Hi! Use black and white image! I used this (http://www.transym.com/download.htm) sw for your image (this sw used testerract ocr engine), and this was my result:
aaü.... AT&T 'F 4:25 PM 47% (...) R V K E E F D R O X l S Bye, ZS On Jul 7, 1:58 am, coneybeare <[email protected]> wrote: > I have been messing around with Tesseract 3.00 for the past couple of > days and have tried a few different approaches to training/image > processing, none of which are really working. I am using the Pocket- > OCR (https://github.com/rcarlsen/Pocket-OCR) app for iPhone to do > the testing, but am training on OSX. > > A sample image that I need to scan is here:http://cloud.coneybeare.net/8FFB > ( I only need the bottom 12 tiles ) > > Running this untrained on english obviously comes up as > garbage:http://cloud.coneybeare.net/8Fbt > > I am just unclear as what I need to do exactly to train tesseract to > detect these. I have gone through the training, made traineddata files > and am familiar with the way to do it, but I must have my training > strategy all wrong. I have tried many different training images > (http://cloud.coneybeare.net/8FKj) but I just can't get results. Is > it best to create a new font, with each "tile" representing a new > letter? Of is it best to do some fancy image processing and cropping > tesseract scan? Is there any optimizations I can do if I know I am > only dealing with uppercase letters, and no words, numbers or > punctuation? What should I do to reliably train tesseract for > detecting the tiles in this image? -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en

