I've attached two files.

The first file is my original one. It returns empty page (with 
eng.traineddata).

I noticed that there was no margin at the top and little at the bottom.
So I used gimp to add about 4 pixels at the top and bottom. The result
is the second attached file.

This ocred properly.

Command line:
tesseract -c tessedit_create_tsv=1 tess_1_1b.tif tess

Output:
level   page_num    block_num   par_num line_num    word_num    left    top 
width   height  conf    text
1   1   0   0   0   0   0   0   336 110 -1<>
2   1   1   0   0   0   28  7   270 98  -1<>
3   1   1   1   0   0   28  7   270 98  -1<>
4   1   1   1   1   0   28  7   270 98  -1<>
5   1   1   1   1   1   28  7   270 98  91  A1.01


A1.01  with a confidence of 91

Should I file a bug? Or always pad my images with whitespace?

thanks

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To post to this group, send email to [email protected].
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/40a4828d-9a46-4e36-9b22-8b925f39a046%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Reply via email to