Is it hard to add a new font to existing .traineddata?

haoest Tue, 06 Sep 2011 23:41:55 -0700

I read the instructions (http://code.google.com/p/tesseract-ocr/wiki/
TrainingTesseract3) several times over before I attempted, but am
still uncertain.


I am trying to add a new font, OCR-A, to the existing eng.traineddata
file. All I need is the digits from 0 to 9, so I made a tif file
consist of those 10 characters, made a box file and .tr file out of
it, and this is where I hit the road block.

I don't think I can simply append the output of cntraining or
mftraining into the existing eng.inttemp/normproto. I need to rebuild
ALL the .tr files from the original English tif/box package and then
feed all of them, including my own .tr file, into the training
prorgram to re-produce the inttmp and proto files.

Is this correct, and is there an easier way? I just want 10 characters
in OCR-A (http://en.wikipedia.org/wiki/OCR-A_font)

-- 
You received this message because you are subscribed to the Google
Groups "tesseract-ocr" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to
[email protected]
For more options, visit this group at
http://groups.google.com/group/tesseract-ocr?hl=en

Is it hard to add a new font to existing .traineddata?

Reply via email to