On 28 February 2011 15:17, Giuseppe Menga <[email protected]> wrote:
> at Politecnico di Torino we are using the release 3.0.0 of tesseract, with
> the standard english training.
> Obviously the software doesn’t recognize pages of text rotated upside down
> and we would not expect it does, however with surprise, it recognizes with a
> little worse performance text rotated of 90° counter clockwise, but not
> clockwise.
> How that is possible?

It's a side-effect of support for Japanese, Chinese, etc.

> We have to recognize text we don’t know in advance the orientation, and I
> know that Leptonica should be used for page layout analysis.
> However, does tesseract offers internal facilities to recognize text
> orientation?
> And if so, how to activate these facilities or at least to return tentative
> baselines?

There's an orientation/script detection module in the 3.01 code, but I
haven't even tried to use it, so I couldn't say.

-- 
<Leftmost> jimregan, that's because deep inside you, you are evil.
<Leftmost> Also not-so-deep inside you.

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To post to this group, send email to [email protected].
To unsubscribe from this group, send email to 
[email protected].
For more options, visit this group at 
http://groups.google.com/group/tesseract-ocr?hl=en.

Reply via email to