Your image is too low resolution, PNG format is actually great. You also
might convert to b/w rather than color. Shoot for about 200-300dpi
(technically pixels per inch). You can get more info on the FAQ --
technically tesseract looks at the font height and evaluates a sort of
vector.
--Sven


On Fri, Feb 22, 2013 at 8:25 PM, Colin Williams <[email protected]>wrote:

> The output from tesseract was so poor on my first attempt with a png, that
> I thought something was wrong. I read somewhere to use tiff. Anyhow I tried
> again, and this time there is some resemblance to the given text (It's
> still crap). Is this the kind of output I should expect from tesseract?
>
>
> On Thursday, February 21, 2013 3:50:41 PM UTC-8, Colin Williams wrote:
>>
>> Hi,
>>
>> I'm trying to capture a screenshot then run OCR on that screenshot. I've
>> tried:
>>
>> > import -depth 24 ss.tiff
>>
>> I also tried
>>
>> > import -depth 24 ss.tiff
>> > convert -alpha Off ss.tiff ssoff.tiff
>>
>> Either way I get
>>
>> tesseract ssoff.tiff output
>> Tesseract Open Source OCR Engine v3.02.02 with Leptonica
>> Error in pixReadFromTiffStream: can't handle bpp > 32
>> Error in pixReadStreamTiff: pix not read
>> Error in pixReadStream: tiff: no pix returned
>> Error in pixRead: pix not read
>> Unsupported image type.
>>
>>
>> So how do I go about creating a screenshot that tesseract can read with
>> the imagemagick command line tools?
>>
>  --
> --
> You received this message because you are subscribed to the Google
> Groups "tesseract-ocr" group.
> To post to this group, send email to [email protected]
> To unsubscribe from this group, send email to
> [email protected]
> For more options, visit this group at
> http://groups.google.com/group/tesseract-ocr?hl=en
>
> ---
> You received this message because you are subscribed to the Google Groups
> "tesseract-ocr" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to [email protected].
> For more options, visit https://groups.google.com/groups/opt_out.
>
>
>



-- 
``All that is gold does not glitter,
  not all those who wander are lost;
the old that is strong does not wither,
  deep roots are not reached by the frost.
>From the ashes a fire shall be woken,
  a light from the shadows shall spring;
renewed shall be blade that was broken,
  the crownless again shall be king.”

-- 
-- 
You received this message because you are subscribed to the Google
Groups "tesseract-ocr" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to
[email protected]
For more options, visit this group at
http://groups.google.com/group/tesseract-ocr?hl=en

--- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
For more options, visit https://groups.google.com/groups/opt_out.


Reply via email to