Train Tesseract with sample word images

Raj Julha Wed, 06 Jul 2011 08:44:58 -0700

Hi

I'm planning to train Tesseract on handwritten text, from mainly
historical documents. Because of the cursive nature of the handwritten
text it is difficult to isolate single characters so I was planning to
create images of words and then use a list of words as training
source. Alternatively I could create a text file with the handwritten
transcription and the coordinates of each word on the image. Can I use
that as input for tesseract training? I'm mainly interested in using
the command line version.


Cheers

Raj

-- 
You received this message because you are subscribed to the Google
Groups "tesseract-ocr" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to
[email protected]
For more options, visit this group at
http://groups.google.com/group/tesseract-ocr?hl=en

Train Tesseract with sample word images

Reply via email to