Hi I have my new language "lan" set up. when I call
tesseract p.tiff -l lan I get poor results. So I decide to add p.tiff to the training set. Basically I have a big tiff so I add p.tiff as the last page. I create the box file of the big tiff: tesseract lan.normal.exo0 tiff lan.normal.exo0 -l lan batch.nochop makebox I open the auto generated box file and it actually has better results for analyzing p.tiff than using the ocr directly. To my understanding both methods use the same traineddata file. Is there some flag I am missing from the first command? Does this make sense? I am analyzing hand written text if that matters. Yaron -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en

