I searched a lot and found this: tesseract image.tif boxes batch.nochop makebox
If I invoke that, i get a boxes.txt file with what appear to be coordinates. But they are too large. I read somewhere that tesseract computes the coordinates from the bottom of the image and not from the top left corner (from the tests I did, this does not appear to be valid). There are also two instances of the same word (same combination of letters) appearing in boxes.txt, whereas the image contains only one instance. Can anybody please shed some light here? On Mon, May 24, 2010 at 4:59 AM, gadv <[email protected]> wrote: > I'm using Linux and Mac OS X and I want to use tesseract to identify a > specific word from a desktop screenshot. I want to apply an > imagemagick filter at that specific area. How would I get the exact > dimensions of the rectangle/area of that keyword? > > -- > You received this message because you are subscribed to the Google Groups > "tesseract-ocr" group. > To post to this group, send email to [email protected]. > To unsubscribe from this group, send email to > [email protected]. > For more options, visit this group at > http://groups.google.com/group/tesseract-ocr?hl=en. > > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected]. To unsubscribe from this group, send email to [email protected]. For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en.

