Hi all! I'm working on a project that wants to digitize judicial expedients. We want to use tesseract but we haven't had great results. I think that if I train tesseract very specifically for the kind of font that the expedients uses we could increase the positive results but I couldn't trained my character set. I have installed tesseract 3.01 in Ubuntu 11.04 and I followed the instructions posted on http://code.google.com/p/tesseract-ocr/wiki/TrainingTesseract3. In the step http://code.google.com/p/tesseract-ocr/wiki/TrainingTesseract3#Run_Tesseract_for_TrainingI've got many FATALITIES and I don't know how can I fix it.
I tried with character set images used in spa training but I also had errors. Somebody can give me a simple example step by step to train tesseract for specific charset? Thanks in advance, Esteban. -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en

