With the tesseract executable, you can use the tessedit_create_boxfile 
config variable. For example:

tesseract myimage.png out makebox

where "makebox" is a configuration file that contains the line:

tessedit_create_boxfile 1

In my case, I tested it with Tesseract v3.02.02 on Linux and my config 
file, that had only that one line, was on the same directory I invoked the 
program. However, you can also use the configuration files in the tessdata 
directory that come with the source code.

Tesseract will create the text file "out.box" consisting of lines with six 
columns, like this:
a 24 55 32 70 0
b 41 57 51 71 0
c 59 58 69 73 0

Column 1: the character
Column 2: bounding box lower left X coordinate
Column 3: bounding box lower left Y coordinate
Column 4: bounding box upper right X coordinate
Column 4: bounding box upper right Y coordinate
Column 6: ?

Note that the origin of the coordinate system is on the lower left corner 
of the image, with the Y-axis "pointing upwards".

Julian.

-- 
You received this message because you are subscribed to the Google
Groups "tesseract-ocr" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to
[email protected]
For more options, visit this group at
http://groups.google.com/group/tesseract-ocr?hl=en

Reply via email to