I don“t want to train real fonts! I have many invoices and so on in different scan qualities! theese files should be processed with the fulltext in a database!
I have tried theese files with a bad result! Now I want to train a large number of files, to get a better result! Every time I am not lucky with the result, this page I want to train tesseract. that means I want to extend a traindata file everytime! Am Montag, 9. Dezember 2013 15:50:00 UTC+1 schrieb Nick White: > > On Mon, Dec 09, 2013 at 06:34:03AM -0800, Ingo W. wrote: > > That means, at the end I have hundrets of filenames I should use when I > trying > > to train serveral pages? > > I'm not sure I understand the question. > > If you need to train several different fonts, you should do that all > as part of your new training file. > > But note that retraining tesseract for an existing language probably > isn't worthwhile unless the fonts you're seeking to recognise are > quite different. > > Does that clarify things for you? > > Nick > -- -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en --- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. For more options, visit https://groups.google.com/groups/opt_out.

