Hi all!

I'm working on a project that wants to digitize judicial expedients. We want
to use tesseract but we haven't had great results.
I think that if I train tesseract very specifically for the kind of font
that the expedients uses we could increase the positive results but I
couldn't trained my character set.
I have installed tesseract 3.01 in Ubuntu 11.04 and I followed the
instructions posted on
http://code.google.com/p/tesseract-ocr/wiki/TrainingTesseract3.
In the step
http://code.google.com/p/tesseract-ocr/wiki/TrainingTesseract3#Run_Tesseract_for_TrainingI've
got many FATALITIES and I don't know how can I fix it.

I tried with character set images used in spa training but I also had
errors.

Somebody can give me a simple example step by step to train tesseract for
specific charset?

Thanks in advance,
Esteban.

-- 
You received this message because you are subscribed to the Google
Groups "tesseract-ocr" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to
[email protected]
For more options, visit this group at
http://groups.google.com/group/tesseract-ocr?hl=en

Reply via email to