Your image is too low resolution, PNG format is actually great. You also might convert to b/w rather than color. Shoot for about 200-300dpi (technically pixels per inch). You can get more info on the FAQ -- technically tesseract looks at the font height and evaluates a sort of vector. --Sven
On Fri, Feb 22, 2013 at 8:25 PM, Colin Williams <[email protected]>wrote: > The output from tesseract was so poor on my first attempt with a png, that > I thought something was wrong. I read somewhere to use tiff. Anyhow I tried > again, and this time there is some resemblance to the given text (It's > still crap). Is this the kind of output I should expect from tesseract? > > > On Thursday, February 21, 2013 3:50:41 PM UTC-8, Colin Williams wrote: >> >> Hi, >> >> I'm trying to capture a screenshot then run OCR on that screenshot. I've >> tried: >> >> > import -depth 24 ss.tiff >> >> I also tried >> >> > import -depth 24 ss.tiff >> > convert -alpha Off ss.tiff ssoff.tiff >> >> Either way I get >> >> tesseract ssoff.tiff output >> Tesseract Open Source OCR Engine v3.02.02 with Leptonica >> Error in pixReadFromTiffStream: can't handle bpp > 32 >> Error in pixReadStreamTiff: pix not read >> Error in pixReadStream: tiff: no pix returned >> Error in pixRead: pix not read >> Unsupported image type. >> >> >> So how do I go about creating a screenshot that tesseract can read with >> the imagemagick command line tools? >> > -- > -- > You received this message because you are subscribed to the Google > Groups "tesseract-ocr" group. > To post to this group, send email to [email protected] > To unsubscribe from this group, send email to > [email protected] > For more options, visit this group at > http://groups.google.com/group/tesseract-ocr?hl=en > > --- > You received this message because you are subscribed to the Google Groups > "tesseract-ocr" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to [email protected]. > For more options, visit https://groups.google.com/groups/opt_out. > > > -- ``All that is gold does not glitter, not all those who wander are lost; the old that is strong does not wither, deep roots are not reached by the frost. >From the ashes a fire shall be woken, a light from the shadows shall spring; renewed shall be blade that was broken, the crownless again shall be king.” -- -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en --- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. For more options, visit https://groups.google.com/groups/opt_out.

