Re: ocr of image fails

Sven Pedersen Thu, 15 Nov 2012 07:33:33 -0800

Yes, I think the text size (x-height) was too small. Also, the English
language data may be trained with more fonts, given that Google created it.
--Sven



On Thu, Nov 15, 2012 at 6:43 AM, sascha4j <[email protected]> wrote:

> after converting the image with imagmagick the result is better. not 100%
> but nearly.
>
> the options for imagemagick were
>
> convert -colorspace gray -resize 200% -unsharp 0x8+1.5+0.05
>
>
> Am Donnerstag, 15. November 2012 10:26:21 UTC+1 schrieb sascha4j:
>
>> Hi,
>>
>> i try to ocr some scanned text with tesseract-ocr.
>>
>> for some images the result is quite good.
>>
>> but for this one  ( see attached file) the result is poor.
>>
>> any hints why ? and what i could do to get a better result?
>>
>> i use tesseract 3.0.2 with german language.
>>
>> greetings
>> sascha4j
>>
>>
>  --
> You received this message because you are subscribed to the Google
> Groups "tesseract-ocr" group.
> To post to this group, send email to [email protected]
> To unsubscribe from this group, send email to
> [email protected]
> For more options, visit this group at
> http://groups.google.com/group/tesseract-ocr?hl=en
>



-- 
``All that is gold does not glitter,
  not all those who wander are lost;
the old that is strong does not wither,
  deep roots are not reached by the frost.
>From the ashes a fire shall be woken,
  a light from the shadows shall spring;
renewed shall be blade that was broken,
  the crownless again shall be king.”

-- 
You received this message because you are subscribed to the Google
Groups "tesseract-ocr" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to
[email protected]
For more options, visit this group at
http://groups.google.com/group/tesseract-ocr?hl=en

Re: ocr of image fails

Reply via email to