I received a GREAT answer from svaram for my question about Searching for older threads: google for "touch letters site:groups.google.com/ group/tesseract-ocr"
I did not find the thread I remembered, but the following one looked applicable: http://groups.google.com/group/tesseract-ocr/browse_thread/thread/4506eff0ef665c2b On Oct 7, 10:01 am, SteveP <[email protected]> wrote: > I remember something about "letters that touch" from this forum over a > year ago. I don't remember any details. > > I have no success doing a Search in this forum for things that old. > Does anybody know how to Search for older things? > > On Oct 4, 11:24 pm, Alcareru <[email protected]> wrote: > > > > > Bump... > > > On 12 elo, 13:12, Alcareru <[email protected]> wrote: > > > > Hi > > > > If the rows on an image are close to each other and some letters are > > > connected (like for example j connects to an l on a row below it.) > > > tesseract fails to process the image right. At best it ignores the > > > letter below totally or pulls it on the row above. So if have text > > > like (l conects to g): > > > Aug > > > Helsinki > > > it is read as: > > > Aulg > > > He sinki > > > or > > > Aug > > > He sinki > > > > Is there anything one can do to avoid that, (I'm not too keen on > > > trying to implement an algorithm that tries to figure out where the > > > rows go and space them out a bit, which is the only thing I can come > > > up with.)? Are future releases of tesseract possibly addressing this > > > issue?- Hide quoted text - > > > - Show quoted text -- Hide quoted text - > > - Show quoted text - --~--~---------~--~----~------------~-------~--~----~ You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en -~----------~----~----~----~------~----~------~--~---

