Dear Giuseppe,
Could you post some samples to analyze?
If you are afraid that tesseract page layout doesn't work on rotated image,
you can run step-by-step as belows:
1. Firstly, you can call tesseract to FindLinesCreateBlockList (have a look
at TessBaseAPI class), you should achieved a BLOCK_LIST.
2. Now, please check BLOCK_LIST:
I showed here only member fields:
...
ROW_LIST rows; //< rows in block
...
FCOORD skew_; //< Direction of true horizontal.
ICOORD median_size_; //< Median size of blobs.
And here are ROW class:
....
inT32 kerning; //inter char gap
inT32 spacing; //inter word gap
TBOX bound_box; //bounding box
float xheight; //height of line
float ascrise; //size of ascenders
float descdrop; //-size of descenders
WERD_LIST words; //words
QSPLINE baseline; //baseline spline
...
A page included block(s), and a block included row(s)....
3. Try to visualize any things you need to have an overview of
segmentation/detection step worked...
Also, if you want to understand how to tesseract works, please read some
papers in doc folder, they have been published by Ray.
Hope it's helpful to you!
Cong.
--
You received this message because you are subscribed to the Google Groups
"tesseract-ocr" group.
To post to this group, send email to [email protected].
To unsubscribe from this group, send email to
[email protected].
For more options, visit this group at
http://groups.google.com/group/tesseract-ocr?hl=en.