I received a GREAT answer from svaram for my question about Searching
for older threads:  google for "touch letters site:groups.google.com/
group/tesseract-ocr"

I did not find the thread I remembered, but the following one looked
applicable:

http://groups.google.com/group/tesseract-ocr/browse_thread/thread/4506eff0ef665c2b

On Oct 7, 10:01 am, SteveP <[email protected]> wrote:
> I remember something about "letters that touch" from this forum over a
> year ago.  I don't remember any details.
>
>  I have no success doing a Search in this forum for things that old.
> Does anybody know how to Search for older things?
>
> On Oct 4, 11:24 pm, Alcareru <[email protected]> wrote:
>
>
>
> > Bump...
>
> > On 12 elo, 13:12, Alcareru <[email protected]> wrote:
>
> > > Hi
>
> > > If the rows on an image are close to each other and some letters are
> > > connected (like for example j connects to an l on a row below it.)
> > > tesseract fails to process the image right. At best it ignores the
> > > letter below totally or pulls it on the row above. So if have text
> > > like (l conects to g):
> > > Aug
> > >  Helsinki
> > > it is read as:
> > > Aulg
> > >  He sinki
> > > or
> > > Aug
> > >  He sinki
>
> > > Is there anything one can do to avoid that, (I'm not too keen on
> > > trying to implement an algorithm that tries to figure out where the
> > > rows go and space them out a bit, which is the only thing I can come
> > > up with.)? Are future releases of tesseract possibly addressing this
> > > issue?- Hide quoted text -
>
> > - Show quoted text -- Hide quoted text -
>
> - Show quoted text -
--~--~---------~--~----~------------~-------~--~----~
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to 
[email protected]
For more options, visit this group at 
http://groups.google.com/group/tesseract-ocr?hl=en
-~----------~----~----~----~------~----~------~--~---

Reply via email to