Language Files

georg Mon, 26 Aug 2013 04:44:15 -0700

Hello,

I have a question regarding language files.


We have a set of characters, which sometimes has cut off characters.

It is my understanding that I can not train very different looking 
characters in one set, because it causes tesseract to get confused.

I would like to generate 2 tiffs (one for complete characters and one for 
cut off ones) and then do the mft training.

Is it true that mft training assembles both tiffs in one language file and 
runs tesseract twice, first with the tiff for the whole characters and once 
for the cut off characters?

Does tesseract keep the tiffs separate although they are in the same 
language file?

How would you work this problem? - I want to try and keep the training 
process as simple as possible (it is already complicated enough).

Thanks for your help!

Take care,

Georg


-- 
-- 
You received this message because you are subscribed to the Google
Groups "tesseract-ocr" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to
[email protected]
For more options, visit this group at
http://groups.google.com/group/tesseract-ocr?hl=en

--- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
For more options, visit https://groups.google.com/groups/opt_out.

Language Files

Reply via email to