regarding API - have a look at GetHOCRText[1] and AddBoxTohOCR[2] functions as example for getting bounding box information.
[1] http://code.google.com/p/tesseract-ocr/source/browse/trunk/api/baseapi.cpp?r=760#1055 [2] http://code.google.com/p/tesseract-ocr/source/browse/trunk/api/baseapi.cpp?r=760#1035 -- Zdenko On Wed, Oct 3, 2012 at 4:02 PM, Sven Pedersen <[email protected]>wrote: > The hOCR output option (embedded in HTML) has that information, but > I'm not aware of any other option. The API might allow access to that > info. > --Sven > > On Wed, Oct 3, 2012 at 7:35 AM, Attila Somogyi <[email protected]> > wrote: > > Hi! > > > > How can i get the x,y location of the characters/words recognized by > > tesseract? > > > > -- > > You received this message because you are subscribed to the Google > > Groups "tesseract-ocr" group. > > To post to this group, send email to [email protected] > > To unsubscribe from this group, send email to > > [email protected] > > For more options, visit this group at > > http://groups.google.com/group/tesseract-ocr?hl=en > > > > -- > ``All that is gold does not glitter, > not all those who wander are lost; > the old that is strong does not wither, > deep roots are not reached by the frost. > From the ashes a fire shall be woken, > a light from the shadows shall spring; > renewed shall be blade that was broken, > the crownless again shall be king.” > > -- > You received this message because you are subscribed to the Google > Groups "tesseract-ocr" group. > To post to this group, send email to [email protected] > To unsubscribe from this group, send email to > [email protected] > For more options, visit this group at > http://groups.google.com/group/tesseract-ocr?hl=en > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en

