Tesseract's textline finder relies a bit too heavily on there being several
lines in a text block. The existing line finder is too unstable to make it
work well both ways.There are plans to fix this in 3.00, and there will also
be a new API to control the layout analysis mode to force it to believe that
all the text in the image is a single line, single word, or even single
character.
Ray.

On Sun, Nov 9, 2008 at 11:12 PM, Tien Dung <[EMAIL PROTECTED]> wrote:

> Hi Ray,
>
> One of the text line recognition scripts in OCRopus said that:
> http://ocropus.googlecode.com/svn/trunk/ocroscript/scripts/rec-ltess.lua
>
> -- Example of using the Tesseract line recognizer together with
> -- OCRopus layout analysis.
> -- Note: *this does not work very well right now because Tesseract*
> -- *has problems with recognizing text in individual lines*.
>
> Is it true that Tesseract has problems with recognizing text in individual
> lines?
>
> If yes, how can we fix it?
>
> Best regards,
>
> --
> Tien Dung
> http://codemonkeycode.blogspot.com/
>
> >
>

--~--~---------~--~----~------------~-------~--~----~
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to [EMAIL PROTECTED]
For more options, visit this group at 
http://groups.google.com/group/tesseract-ocr?hl=en
-~----------~----~----~----~------~----~------~--~---

Reply via email to