Hi, Just a quick reply: I tried it on Windows XP with tesseract 3.00 and it produced bad result (nothing usefull).
InfranView informations dialog showed that image has resolution 72x72 DPI -> to low... So I resampled it (with Lanczos algorithm) from 100% to 300% size, set DPI to 300 and decreased number of color to 16 (in InfranView because I have no time to play with ImageMagick's options ;-) )... Than OCR result was much more better with several mistakes (just quick check)... So with several image improvements you can get good OCR result. BR, Zd. On Fri, Feb 18, 2011 at 3:53 PM, Bob Kuo <[email protected]> wrote: > Hello all > > Please forgive the newbie question. I've seen this posted several > times before, and I thought I had the right solution but apparently > not. Attached is a PNG that I'd like to run through tesseract. I > used ImageMagick's convert to change it into a tiff: > > convert -density 200 -units PixelsPerInch test_page.png -type > Grayscale +compress test_input.tif > > (I've also tried to do this at -density 300 with the same results) > > The resulting TIF is attached. When I run it through tesseract I get > an output file that is one byte and is basically blank. Command and > output below. > > tesseract test_input.tif output -l eng > Tesseract Open Source OCR Engine > Image has 8 * 1 bits per pixel, and size (375,350) > Resolution=200 > > I saw some other threads about a similar problem, but the solutions > were to scale it to 200 or 300 DPI, make sure it was in grayscale, > remove the alpha layer, and somewhere else it said it was fixed in > Tesseract 2.04. I'm using Tesseract 2.04 on Mac OS X 10.6.6 and > ImageMagick 6.6.7-1. Is my image just unsuitable for OCR-ing? > > I appreciate any help. > > Thanks, > > Bob > > -- > You received this message because you are subscribed to the Google Groups > "tesseract-ocr" group. > To post to this group, send email to [email protected]. > To unsubscribe from this group, send email to > [email protected]. > For more options, visit this group at > http://groups.google.com/group/tesseract-ocr?hl=en. > > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected]. To unsubscribe from this group, send email to [email protected]. For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en.

