Re: user-words

Calomer Thu, 29 Sep 2011 11:26:11 -0700

I'll try my best to answer, tho I'm hardly eligible.

According to training instructions (on 
http://code.google.com/p/tesseract-ocr/wiki/TrainingTesseract3)
and general OCR knowledge, you cannot train solely by new characters.
You need training images, you need to create boxes (with any box
editor, but I only used Qt Box Editor). Once you create new boxes
around your new tiff image, and label them accordingly, you should be
ready for training.


Keep in mind, you'll need at least 12 low x-height in pixels
(preferably around 20 pixels), variety in images would be nice for
increased performance.

Follow training instructions, train your own language file, try OCR
again, if you fail again, I'm sure someone else who has wider
knowledge than me should be able to answer your further questions.

On Sep 29, 2:44 pm, Bonny <[email protected]> wrote:
> Nobody know or the question is too silly?

-- 
You received this message because you are subscribed to the Google
Groups "tesseract-ocr" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to
[email protected]
For more options, visit this group at
http://groups.google.com/group/tesseract-ocr?hl=en

Re: user-words

Reply via email to