Character spacing question.

MARTIN Pierre Thu, 07 Jul 2011 08:47:56 -0700

Hello list!

i have been asking a few questions here about a year ago, and have been 
following the list quietly since then.
i'm using tesseract in one of my project (i believe it's not the HEAD version 
but a 3.0 ish version).


i am running recognition on a set of documents which have a small corner at the 
end of the text line i'm trying to read... (See picture attached, bottom right 
corner of the text presents this small corner).

This small corner line induces very often an error with the last character, and 
i would like to be able to know the character spacing of each recognized 
characters to eliminate potential errors when the spacing is not "proportional" 
to the mean of all other spacings... How can i retrieve the spacing of a 
recognized text within the C++ wrapper API?

The rest of the recognition is going pretty well... But i can't rely of picture 
pre-processing since sometimes this corner is as dark as the rest of the 
picture! Also, the ending character ('<' in the example) can be different as 
well (Can be a digit or anything else.

Thanks a lot for your time and help!
Pierre.

-- 
You received this message because you are subscribed to the Google
Groups "tesseract-ocr" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to
[email protected]
For more options, visit this group at
http://groups.google.com/group/tesseract-ocr?hl=en

<<inline: CornerExample.jpg>>

Character spacing question.

Reply via email to