If I duplicate the column 9 times, so that there's ten columns with the same numbers, it reads it correctly. Running these results through the training tools didn't help it recognize the original image, though. Running tesseract on images with a single digit yielded nothing as well.
In my program, do I have to programatically duplicate my column of numbers several times and then figure out what the result was supposed to be... or can I train tesseract to recognize a single column? I suppose duplicating it will work, but it seems like a bad hack. On Mon, Feb 13, 2012 at 10:42 AM, Chris <[email protected]> wrote: > I'd try segmenting the numbers out yourself and feeding them into > tesseract as individual characters. Might work better than feeding it > the whole image. > > Make sure you put some padding around each character. > > On Feb 13, 1:56 am, JD <[email protected]> wrote: > > I'm using v 3.01 on Windows 7 to perform OCR on another program. I > > don't have access to the fonts the program is using, so I trained > > tesseract using some screenshots, and so far the text recognition is > > far better than I expected. However, when I try to process a > > screenshot that contains only a few numbers, it doesn't match anything > > at all. If was matching garbage, or the wrong numbers, then I'd just > > keep working on improving the training... but it doesn't find > > anything. Does anyone have a suggestion about what I should try? > > > > It doesn't look like I can attach a screenshot, but the numbers are in > > a column... something like this: > > > > 10 > > 13 > > 14 > > 15 > > 17 > > > > I pre-process the screenshots so the text is black on white. I also > > zoom in on the images, so they're slightly blurred (only very > > slightly)... but the text recognition is near perfect, so I don't > > think that's an issue. Plus, it seems like it should find SOMETHING. > > -- > You received this message because you are subscribed to the Google > Groups "tesseract-ocr" group. > To post to this group, send email to [email protected] > To unsubscribe from this group, send email to > [email protected] > For more options, visit this group at > http://groups.google.com/group/tesseract-ocr?hl=en > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en

