It's fairly rare that an application needs to train Tesseract for English fonts because Tesseract already has so many it its training set, unless you are talking about very special fonts which you say they are not. Care to share the images you are scanning?
Patrick On Mon, Aug 26, 2013 at 6:08 PM, Jon Jacobs <[email protected]> wrote: > I read that one trains tesseract for new language character sets. > I have been getting very poor accruacy (around 20%) for plain-old English > images with ordinary fonts (I thought). > Should I perhaps train it on samples of these images? > > Thanks, > > -- > -- > You received this message because you are subscribed to the Google > Groups "tesseract-ocr" group. > To post to this group, send email to [email protected] > To unsubscribe from this group, send email to > [email protected] > For more options, visit this group at > http://groups.google.com/group/tesseract-ocr?hl=en > > --- > You received this message because you are subscribed to the Google Groups > "tesseract-ocr" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to [email protected]. > For more options, visit https://groups.google.com/groups/opt_out. > -- Patrick Questembert, *ScanBizCards* +1-917-250-4177 | www.scanbizcards.com twitter.com/ScanBizCards | www.facebook.com/ScanBizCards Just released: Power Contacts - http://itunes.apple.com/us/app/power-contacts/id476986356?mt=8 -- -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en --- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. For more options, visit https://groups.google.com/groups/opt_out.

