Letters from different rows are connected

Alcareru Wed, 12 Aug 2009 03:12:21 -0700

Hi

If the rows on an image are close to each other and some letters are
connected (like for example j connects to an l on a row below it.)
tesseract fails to process the image right. At best it ignores the
letter below totally or pulls it on the row above. So if have text
like (l conects to g):
Aug
 Helsinki
it is read as:
Aulg
 He sinki
or
Aug
 He sinki


Is there anything one can do to avoid that, (I'm not too keen on
trying to implement an algorithm that tries to figure out where the
rows go and space them out a bit, which is the only thing I can come
up with.)? Are future releases of tesseract possibly addressing this
issue?
--~--~---------~--~----~------------~-------~--~----~
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to 
[email protected]
For more options, visit this group at 
http://groups.google.com/group/tesseract-ocr?hl=en
-~----------~----~----~----~------~----~------~--~---

Letters from different rows are connected

Reply via email to