Hello list! i have been asking a few questions here about a year ago, and have been following the list quietly since then. i'm using tesseract in one of my project (i believe it's not the HEAD version but a 3.0 ish version).
i am running recognition on a set of documents which have a small corner at the
end of the text line i'm trying to read... (See picture attached, bottom right
corner of the text presents this small corner).
This small corner line induces very often an error with the last character, and
i would like to be able to know the character spacing of each recognized
characters to eliminate potential errors when the spacing is not "proportional"
to the mean of all other spacings... How can i retrieve the spacing of a
recognized text within the C++ wrapper API?
The rest of the recognition is going pretty well... But i can't rely of picture
pre-processing since sometimes this corner is as dark as the rest of the
picture! Also, the ending character ('<' in the example) can be different as
well (Can be a digit or anything else.
Thanks a lot for your time and help!
Pierre.
--
You received this message because you are subscribed to the Google
Groups "tesseract-ocr" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to
[email protected]
For more options, visit this group at
http://groups.google.com/group/tesseract-ocr?hl=en
<<inline: CornerExample.jpg>>

