what OS you use and which tesseract version? Zdenko
PS: it worked on windows XP with tesseract 3.00 On Tue, Jun 21, 2011 at 3:17 PM, Esteban Bordón <[email protected]> wrote: > Sorry, I forgot attach it. Anyway font_properties is used from v 3.01 and I > am using v 3.00 > > cheers, > Esteban. > > 2011/6/21 zdenko podobny <[email protected]> > >> If you got error on font_properties file, send also font_properties ;-) >> >> Zdenko >> >> On Tue, Jun 21, 2011 at 2:45 PM, Esteban Bordón <[email protected]>wrote: >> >>> For example using these files provides in >>> http://tesseract-ocr.googlecode.com/files/boxtiff-2.01.spa.tar.gz and >>> the command lines bellow >>> >>> *]$ tesseract spa.cour.g4.tif spa.cour.g4 nobatch box.train >>> ]$ unicharset_extractor spa.cour.g4.box* >>> >>> These commands work ok but I don't know how I must continue >>> If I run: >>> *]$ mftraining -F font_properties -U unicharset spa.cour.g4.tr* >>> I get >>> *Reading spa.cour.g4.tr ... >>> spa.cour.g4 has no defined properties. >>> >>> Error: Illegal short name for a feature! >>> >>> Fatal error: No error trap defined! >>> Signal_termination_handler called with signal 2000* >>> >>> Now I'm trying in tesseract 3.00, then I can't use font_properties: >>> *[ebordon@ebordon ]$ mftraining -U unicharset spa.cour.g4.tr Reading >>> spa.cour.g4.tr ... >>> spa.cour.g4 has no defined properties. >>> >>> Error: Illegal short name for a feature! >>> >>> Fatal error: No error trap defined! >>> Signal_termination_handler called with signal 2000 >>> * >>> and: >>> >>> *[ebordon@ebordon ]$ cntraining spa.cour.g4.tr >>> Reading spa.cour.g4.tr ... >>> >>> Error: Illegal short name for a feature! >>> >>> Fatal error: No error trap defined! >>> Signal_termination_handler called with signal 2000 >>> * >>> Thanks, >>> Esteban. >>> >>> >>> >>> 2011/6/20 Dmitri Silaev <[email protected]> >>> >>>> You have to show us your training images, resulted box files and all >>>> used command lines. >>>> >>>> Warm regards, >>>> Dmitri Silaev >>>> www.CustomOCR.com >>>> >>>> >>>> >>>> >>>> >>>> On Mon, Jun 20, 2011 at 8:04 PM, Esteban Bordón <[email protected]> >>>> wrote: >>>> > Hi all! >>>> > >>>> > I'm working on a project that wants to digitize judicial expedients. >>>> We want >>>> > to use tesseract but we haven't had great results. >>>> > I think that if I train tesseract very specifically for the kind of >>>> font >>>> > that the expedients uses we could increase the positive results but I >>>> > couldn't trained my character set. >>>> > I have installed tesseract 3.01 in Ubuntu 11.04 and I followed the >>>> > instructions posted on >>>> > http://code.google.com/p/tesseract-ocr/wiki/TrainingTesseract3. >>>> > In the step >>>> > >>>> http://code.google.com/p/tesseract-ocr/wiki/TrainingTesseract3#Run_Tesseract_for_Training >>>> > I've got many FATALITIES and I don't know how can I fix it. >>>> > >>>> > I tried with character set images used in spa training but I also had >>>> > errors. >>>> > >>>> > Somebody can give me a simple example step by step to train tesseract >>>> for >>>> > specific charset? >>>> > >>>> > Thanks in advance, >>>> > Esteban. >>>> > >>>> > -- >>>> > You received this message because you are subscribed to the Google >>>> > Groups "tesseract-ocr" group. >>>> > To post to this group, send email to [email protected] >>>> > To unsubscribe from this group, send email to >>>> > [email protected] >>>> > For more options, visit this group at >>>> > http://groups.google.com/group/tesseract-ocr?hl=en >>>> > >>>> >>> >>> -- >>> You received this message because you are subscribed to the Google >>> Groups "tesseract-ocr" group. >>> To post to this group, send email to [email protected] >>> To unsubscribe from this group, send email to >>> [email protected] >>> For more options, visit this group at >>> http://groups.google.com/group/tesseract-ocr?hl=en >>> >> >> -- >> You received this message because you are subscribed to the Google >> Groups "tesseract-ocr" group. >> To post to this group, send email to [email protected] >> To unsubscribe from this group, send email to >> [email protected] >> For more options, visit this group at >> http://groups.google.com/group/tesseract-ocr?hl=en >> > > -- > You received this message because you are subscribed to the Google > Groups "tesseract-ocr" group. > To post to this group, send email to [email protected] > To unsubscribe from this group, send email to > [email protected] > For more options, visit this group at > http://groups.google.com/group/tesseract-ocr?hl=en > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en

