Hi everyone! I am trying to figure out what the two numbers after "baseline" in the hocr's ocr_line tags are, but so far I couldn't sort it out. Here is one of this tags as an example: <span xmlns="http://www.w3.org/1999/xhtml" class="ocr_line" id="line_1_48" title="bbox 879 1300 1240 1335; baseline 0 -6">
So far, I've noticed some things: If line is skewed upward (or anticlockwise) or not skewed, the second number is zero if no characters extend below the baseline (e.g. p's, q's, y's). If characters do extend below the baseline, then this value is negative. If line is skewed downward (or clockwise), second number is negative. Second number is always integer. First number is zero if no skew, negative if line skewed anticlockwise, and positive if clockwise. This value is decimal. I thought it could be an angle expressed in some way, but I couldn't understand how. Documentation says: baseline pn pn-1 … p0 - a polynomial describing the baseline of a line of text, the polynomial is in the coordinate system of the line, with the bottom left of the bounding box as the origin But it is not clear to me. Can anybody help me here? I would really appreciate it! -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at https://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/b6bc8ab0-124e-4685-b9fc-ea7c3151328e%40googlegroups.com. For more options, visit https://groups.google.com/d/optout.

