On Tue, May 21, 2013 at 10:47:52AM -0700, [email protected] wrote: > Would anyone be able to point me to the tesseract-ocr 3.02 english training > data (the box/tif/.tr files associated with the standard eng.traineddata) as > well as the corresponding font_properties file? I didn't find it in the files > provided in tesseract-ocr-3.002.eng.tar.gz or in downloads. I did find the > files for tesseract 2.XX, but not 3.02. I have tried googling for the file > names but have yet to come up with anything.
Unfortunately the box/tif files for the 3.02 english training data are not publically available (beyond what you can find and use from combine_tessdata -u). For now the only thing you can do is create a separate training and use both as different languages (e.g. -l eng+myeng). It certainly isn't ideal, but beyond the people releasing the training data there isn't much that can be done about it. Nick -- -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en --- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. For more options, visit https://groups.google.com/groups/opt_out.

