If you got error on font_properties file, send also font_properties ;-) Zdenko
On Tue, Jun 21, 2011 at 2:45 PM, Esteban Bordón <[email protected]> wrote: > For example using these files provides in > http://tesseract-ocr.googlecode.com/files/boxtiff-2.01.spa.tar.gz and the > command lines bellow > > *]$ tesseract spa.cour.g4.tif spa.cour.g4 nobatch box.train > ]$ unicharset_extractor spa.cour.g4.box* > > These commands work ok but I don't know how I must continue > If I run: > *]$ mftraining -F font_properties -U unicharset spa.cour.g4.tr* > I get > *Reading spa.cour.g4.tr ... > spa.cour.g4 has no defined properties. > > Error: Illegal short name for a feature! > > Fatal error: No error trap defined! > Signal_termination_handler called with signal 2000* > > Now I'm trying in tesseract 3.00, then I can't use font_properties: > *[ebordon@ebordon ]$ mftraining -U unicharset spa.cour.g4.tr Reading > spa.cour.g4.tr ... > spa.cour.g4 has no defined properties. > > Error: Illegal short name for a feature! > > Fatal error: No error trap defined! > Signal_termination_handler called with signal 2000 > * > and: > > *[ebordon@ebordon ]$ cntraining spa.cour.g4.tr > Reading spa.cour.g4.tr ... > > Error: Illegal short name for a feature! > > Fatal error: No error trap defined! > Signal_termination_handler called with signal 2000 > * > Thanks, > Esteban. > > > > 2011/6/20 Dmitri Silaev <[email protected]> > >> You have to show us your training images, resulted box files and all >> used command lines. >> >> Warm regards, >> Dmitri Silaev >> www.CustomOCR.com >> >> >> >> >> >> On Mon, Jun 20, 2011 at 8:04 PM, Esteban Bordón <[email protected]> >> wrote: >> > Hi all! >> > >> > I'm working on a project that wants to digitize judicial expedients. We >> want >> > to use tesseract but we haven't had great results. >> > I think that if I train tesseract very specifically for the kind of font >> > that the expedients uses we could increase the positive results but I >> > couldn't trained my character set. >> > I have installed tesseract 3.01 in Ubuntu 11.04 and I followed the >> > instructions posted on >> > http://code.google.com/p/tesseract-ocr/wiki/TrainingTesseract3. >> > In the step >> > >> http://code.google.com/p/tesseract-ocr/wiki/TrainingTesseract3#Run_Tesseract_for_Training >> > I've got many FATALITIES and I don't know how can I fix it. >> > >> > I tried with character set images used in spa training but I also had >> > errors. >> > >> > Somebody can give me a simple example step by step to train tesseract >> for >> > specific charset? >> > >> > Thanks in advance, >> > Esteban. >> > >> > -- >> > You received this message because you are subscribed to the Google >> > Groups "tesseract-ocr" group. >> > To post to this group, send email to [email protected] >> > To unsubscribe from this group, send email to >> > [email protected] >> > For more options, visit this group at >> > http://groups.google.com/group/tesseract-ocr?hl=en >> > >> > > -- > You received this message because you are subscribed to the Google > Groups "tesseract-ocr" group. > To post to this group, send email to [email protected] > To unsubscribe from this group, send email to > [email protected] > For more options, visit this group at > http://groups.google.com/group/tesseract-ocr?hl=en > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en

