Hello Everyone,

Where can I find the box/tif combo for the eng.traineddata that Tessearct 
3.02 provides for download?

Since I cannot append new font information to compiled training data I have 
to completely create new data even for the fonts that Tesseract's prebuilt 
training provides.  When I do this I lose accuracy over the provided 
training data significantly (accuracy ends up being around 50% down from 
above 90%).   If I had Tessearct's source box/tif files then adding my 
fonts should at worst still have nearly the same accuracy as the project 
provided files for documents that contain the default fonts.  It would seem 
that Tesseract providing the box/tif originally used would be the easiest 
solution in keeping user's accuracy up while attempting new fonts.

I have found box/tif files for tessearct 2.0 but not 3.0.  When I use the 
box/tif files from 2.0 for the fonts provided like Arial, Courier New, etc 
I significantly lose accuracy.

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To post to this group, send email to [email protected].
Visit this group at http://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/88f08cfe-ef4a-4b20-8b95-1bdb29b702b4%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Reply via email to