Regarding question 2, I just found 2 sites to explain the control parameters:
https://code.google.com/p/tesseract-ocr/wiki/ControlParams http://www.sk-spell.sk.cx/tesseract-ocr-parameters-in-302-version 在 2015年5月11日星期一 UTC+8下午8:49:04,smwikipedia smwikipedia写道: > > > > 1. For tesseract 3.02, after installation I see there's a pre-trained > *eng.traineddata* file in the tessdata folder. How is this file > generated? What font does it target? Can I blindly use it for my OCR > application? > > 2. For tesseract 3.03, I see there's a new option "--print-parameters" for > the tesseract executable. There're more than 600 parameters. How am I > supposed to use them? If I need to tune them, how? > > 3. During my experimentation, I see tesseract works better for some font > type than other font type. Is this true? Which font has the best precision? > > > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at http://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/e201c2a8-3271-40f6-87a0-183245a19abb%40googlegroups.com. For more options, visit https://groups.google.com/d/optout.

