again, processing the box data will help you identify such an occurrence,
you can then "filter" and reprocess the "hard" words separately.

On Mon, Oct 22, 2012 at 10:12 AM, GeorgeS <[email protected]> wrote:

> With the current Tesseract engine I've noticed that if I perform a
> full-page OCR and there's an area with something printed in a slightly
> larger font (14?) than the surrounding text (10-12) then the larger text is
> not recognized. If I pull just that area out as an image and pass it to the
> OCR with the same settings it recognizes the larger text just fine. Is
> there some setting that could cause this behavior? There is no fiddling
> with Min/Max Char size going on.
>
> Thanks!
>
> --
> You received this message because you are subscribed to the Google
> Groups "tesseract-ocr" group.
> To post to this group, send email to [email protected]
> To unsubscribe from this group, send email to
> [email protected]
> For more options, visit this group at
> http://groups.google.com/group/tesseract-ocr?hl=en
>

-- 
You received this message because you are subscribed to the Google
Groups "tesseract-ocr" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to
[email protected]
For more options, visit this group at
http://groups.google.com/group/tesseract-ocr?hl=en

Reply via email to