I don“t want to train real fonts!
I have many invoices and so on in different scan qualities!
theese files should be processed with the fulltext in a database!

I have tried theese files with a bad result!

Now I want to train a large number of files, to get a better result! 
Every time I am not lucky with the result, this page I want to train 
tesseract.

that means I want to extend a traindata file everytime!

Am Montag, 9. Dezember 2013 15:50:00 UTC+1 schrieb Nick White:
>
> On Mon, Dec 09, 2013 at 06:34:03AM -0800, Ingo W. wrote: 
> > That means, at the end I have hundrets of filenames I should use when I 
> trying 
> > to train serveral pages? 
>
> I'm not sure I understand the question. 
>
> If you need to train several different fonts, you should do that all 
> as part of your new training file. 
>
> But note that retraining tesseract for an existing language probably 
> isn't worthwhile unless the fonts you're seeking to recognise are 
> quite different. 
>
> Does that clarify things for you? 
>
> Nick 
>

-- 
-- 
You received this message because you are subscribed to the Google
Groups "tesseract-ocr" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to
[email protected]
For more options, visit this group at
http://groups.google.com/group/tesseract-ocr?hl=en

--- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
For more options, visit https://groups.google.com/groups/opt_out.

Reply via email to