Sorry, I forgot attach it. Anyway font_properties is used from v 3.01 and I am using v 3.00
cheers, Esteban. 2011/6/21 zdenko podobny <[email protected]> > If you got error on font_properties file, send also font_properties ;-) > > Zdenko > > On Tue, Jun 21, 2011 at 2:45 PM, Esteban Bordón <[email protected]> wrote: > >> For example using these files provides in >> http://tesseract-ocr.googlecode.com/files/boxtiff-2.01.spa.tar.gz and the >> command lines bellow >> >> *]$ tesseract spa.cour.g4.tif spa.cour.g4 nobatch box.train >> ]$ unicharset_extractor spa.cour.g4.box* >> >> These commands work ok but I don't know how I must continue >> If I run: >> *]$ mftraining -F font_properties -U unicharset spa.cour.g4.tr* >> I get >> *Reading spa.cour.g4.tr ... >> spa.cour.g4 has no defined properties. >> >> Error: Illegal short name for a feature! >> >> Fatal error: No error trap defined! >> Signal_termination_handler called with signal 2000* >> >> Now I'm trying in tesseract 3.00, then I can't use font_properties: >> *[ebordon@ebordon ]$ mftraining -U unicharset spa.cour.g4.tr Reading >> spa.cour.g4.tr ... >> spa.cour.g4 has no defined properties. >> >> Error: Illegal short name for a feature! >> >> Fatal error: No error trap defined! >> Signal_termination_handler called with signal 2000 >> * >> and: >> >> *[ebordon@ebordon ]$ cntraining spa.cour.g4.tr >> Reading spa.cour.g4.tr ... >> >> Error: Illegal short name for a feature! >> >> Fatal error: No error trap defined! >> Signal_termination_handler called with signal 2000 >> * >> Thanks, >> Esteban. >> >> >> >> 2011/6/20 Dmitri Silaev <[email protected]> >> >>> You have to show us your training images, resulted box files and all >>> used command lines. >>> >>> Warm regards, >>> Dmitri Silaev >>> www.CustomOCR.com >>> >>> >>> >>> >>> >>> On Mon, Jun 20, 2011 at 8:04 PM, Esteban Bordón <[email protected]> >>> wrote: >>> > Hi all! >>> > >>> > I'm working on a project that wants to digitize judicial expedients. We >>> want >>> > to use tesseract but we haven't had great results. >>> > I think that if I train tesseract very specifically for the kind of >>> font >>> > that the expedients uses we could increase the positive results but I >>> > couldn't trained my character set. >>> > I have installed tesseract 3.01 in Ubuntu 11.04 and I followed the >>> > instructions posted on >>> > http://code.google.com/p/tesseract-ocr/wiki/TrainingTesseract3. >>> > In the step >>> > >>> http://code.google.com/p/tesseract-ocr/wiki/TrainingTesseract3#Run_Tesseract_for_Training >>> > I've got many FATALITIES and I don't know how can I fix it. >>> > >>> > I tried with character set images used in spa training but I also had >>> > errors. >>> > >>> > Somebody can give me a simple example step by step to train tesseract >>> for >>> > specific charset? >>> > >>> > Thanks in advance, >>> > Esteban. >>> > >>> > -- >>> > You received this message because you are subscribed to the Google >>> > Groups "tesseract-ocr" group. >>> > To post to this group, send email to [email protected] >>> > To unsubscribe from this group, send email to >>> > [email protected] >>> > For more options, visit this group at >>> > http://groups.google.com/group/tesseract-ocr?hl=en >>> > >>> >> >> -- >> You received this message because you are subscribed to the Google >> Groups "tesseract-ocr" group. >> To post to this group, send email to [email protected] >> To unsubscribe from this group, send email to >> [email protected] >> For more options, visit this group at >> http://groups.google.com/group/tesseract-ocr?hl=en >> > > -- > You received this message because you are subscribed to the Google > Groups "tesseract-ocr" group. > To post to this group, send email to [email protected] > To unsubscribe from this group, send email to > [email protected] > For more options, visit this group at > http://groups.google.com/group/tesseract-ocr?hl=en > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en
font_properties
Description: Binary data

