Yes, I think the text size (x-height) was too small. Also, the English language data may be trained with more fonts, given that Google created it. --Sven
On Thu, Nov 15, 2012 at 6:43 AM, sascha4j <[email protected]> wrote: > after converting the image with imagmagick the result is better. not 100% > but nearly. > > the options for imagemagick were > > convert -colorspace gray -resize 200% -unsharp 0x8+1.5+0.05 > > > Am Donnerstag, 15. November 2012 10:26:21 UTC+1 schrieb sascha4j: > >> Hi, >> >> i try to ocr some scanned text with tesseract-ocr. >> >> for some images the result is quite good. >> >> but for this one ( see attached file) the result is poor. >> >> any hints why ? and what i could do to get a better result? >> >> i use tesseract 3.0.2 with german language. >> >> greetings >> sascha4j >> >> > -- > You received this message because you are subscribed to the Google > Groups "tesseract-ocr" group. > To post to this group, send email to [email protected] > To unsubscribe from this group, send email to > [email protected] > For more options, visit this group at > http://groups.google.com/group/tesseract-ocr?hl=en > -- ``All that is gold does not glitter, not all those who wander are lost; the old that is strong does not wither, deep roots are not reached by the frost. >From the ashes a fire shall be woken, a light from the shadows shall spring; renewed shall be blade that was broken, the crownless again shall be king.” -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en

