You can create a new traineddata file with multiple 'fake-fonts' and then
use in addition to the existing traineddata.

eg. -l deu+newtraineddata

so you don't have to have separate traineddata for each font,

though you'll have separate .tr files - one for each of your 'fake-fonts'

Shree Devi Kumar
____________________________________________________________
भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com


On Mon, Dec 9, 2013 at 8:28 PM, Ingo W. <[email protected]> wrote:

> I don´t want to train real fonts!
> I have many invoices and so on in different scan qualities!
> theese files should be processed with the fulltext in a database!
>
> I have tried theese files with a bad result!
>
> Now I want to train a large number of files, to get a better result!
> Every time I am not lucky with the result, this page I want to train
> tesseract.
>
> that means I want to extend a traindata file everytime!
>
> Am Montag, 9. Dezember 2013 15:50:00 UTC+1 schrieb Nick White:
>
>> On Mon, Dec 09, 2013 at 06:34:03AM -0800, Ingo W. wrote:
>> > That means, at the end I have hundrets of filenames I should use when I
>> trying
>> > to train serveral pages?
>>
>> I'm not sure I understand the question.
>>
>> If you need to train several different fonts, you should do that all
>> as part of your new training file.
>>
>> But note that retraining tesseract for an existing language probably
>> isn't worthwhile unless the fonts you're seeking to recognise are
>> quite different.
>>
>> Does that clarify things for you?
>>
>> Nick
>>
>  --
> --
> You received this message because you are subscribed to the Google
> Groups "tesseract-ocr" group.
> To post to this group, send email to [email protected]
> To unsubscribe from this group, send email to
> [email protected]
> For more options, visit this group at
> http://groups.google.com/group/tesseract-ocr?hl=en
>
> ---
> You received this message because you are subscribed to the Google Groups
> "tesseract-ocr" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to [email protected].
> For more options, visit https://groups.google.com/groups/opt_out.
>

-- 
-- 
You received this message because you are subscribed to the Google
Groups "tesseract-ocr" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to
[email protected]
For more options, visit this group at
http://groups.google.com/group/tesseract-ocr?hl=en

--- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
For more options, visit https://groups.google.com/groups/opt_out.

Reply via email to