Hi

I have my new language "lan" set up. when I call 

tesseract p.tiff -l lan

I get poor results. So I decide to add p.tiff to the training set. 
Basically I have a big tiff so I add p.tiff as the last page. I create the 
box file of the big tiff:

tesseract lan.normal.exo0 tiff lan.normal.exo0 -l lan batch.nochop makebox

I open the auto generated box file and it actually has better results for 
analyzing p.tiff than using the ocr directly.

To my understanding both methods use the same traineddata file. Is there 
some flag I am missing from the first command? Does this make sense? I am 
analyzing hand written text if that matters.

Yaron

-- 
You received this message because you are subscribed to the Google
Groups "tesseract-ocr" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to
[email protected]
For more options, visit this group at
http://groups.google.com/group/tesseract-ocr?hl=en

Reply via email to