I have tried with tesseract 3.00 in Fedora 14 and Ubuntu 11.04. Which commands you have used?
Maybe I must to try on XP or Windows 7... 2011/6/21 zdenko podobny <[email protected]> > what OS you use and which tesseract version? > > Zdenko > > PS: it worked on windows XP with tesseract 3.00 > > On Tue, Jun 21, 2011 at 3:17 PM, Esteban Bordón <[email protected]> wrote: > >> Sorry, I forgot attach it. Anyway font_properties is used from v 3.01 and >> I am using v 3.00 >> >> cheers, >> Esteban. >> >> 2011/6/21 zdenko podobny <[email protected]> >> >>> If you got error on font_properties file, send also font_properties ;-) >>> >>> Zdenko >>> >>> On Tue, Jun 21, 2011 at 2:45 PM, Esteban Bordón <[email protected]>wrote: >>> >>>> For example using these files provides in >>>> http://tesseract-ocr.googlecode.com/files/boxtiff-2.01.spa.tar.gz and >>>> the command lines bellow >>>> >>>> *]$ tesseract spa.cour.g4.tif spa.cour.g4 nobatch box.train >>>> ]$ unicharset_extractor spa.cour.g4.box* >>>> >>>> These commands work ok but I don't know how I must continue >>>> If I run: >>>> *]$ mftraining -F font_properties -U unicharset spa.cour.g4.tr* >>>> I get >>>> *Reading spa.cour.g4.tr ... >>>> spa.cour.g4 has no defined properties. >>>> >>>> Error: Illegal short name for a feature! >>>> >>>> Fatal error: No error trap defined! >>>> Signal_termination_handler called with signal 2000* >>>> >>>> Now I'm trying in tesseract 3.00, then I can't use font_properties: >>>> *[ebordon@ebordon ]$ mftraining -U unicharset spa.cour.g4.tr Reading >>>> spa.cour.g4.tr ... >>>> spa.cour.g4 has no defined properties. >>>> >>>> Error: Illegal short name for a feature! >>>> >>>> Fatal error: No error trap defined! >>>> Signal_termination_handler called with signal 2000 >>>> * >>>> and: >>>> >>>> *[ebordon@ebordon ]$ cntraining spa.cour.g4.tr >>>> Reading spa.cour.g4.tr ... >>>> >>>> Error: Illegal short name for a feature! >>>> >>>> Fatal error: No error trap defined! >>>> Signal_termination_handler called with signal 2000 >>>> * >>>> Thanks, >>>> Esteban. >>>> >>>> >>>> >>>> 2011/6/20 Dmitri Silaev <[email protected]> >>>> >>>>> You have to show us your training images, resulted box files and all >>>>> used command lines. >>>>> >>>>> Warm regards, >>>>> Dmitri Silaev >>>>> www.CustomOCR.com >>>>> >>>>> >>>>> >>>>> >>>>> >>>>> On Mon, Jun 20, 2011 at 8:04 PM, Esteban Bordón <[email protected]> >>>>> wrote: >>>>> > Hi all! >>>>> > >>>>> > I'm working on a project that wants to digitize judicial expedients. >>>>> We want >>>>> > to use tesseract but we haven't had great results. >>>>> > I think that if I train tesseract very specifically for the kind of >>>>> font >>>>> > that the expedients uses we could increase the positive results but I >>>>> > couldn't trained my character set. >>>>> > I have installed tesseract 3.01 in Ubuntu 11.04 and I followed the >>>>> > instructions posted on >>>>> > http://code.google.com/p/tesseract-ocr/wiki/TrainingTesseract3. >>>>> > In the step >>>>> > >>>>> http://code.google.com/p/tesseract-ocr/wiki/TrainingTesseract3#Run_Tesseract_for_Training >>>>> > I've got many FATALITIES and I don't know how can I fix it. >>>>> > >>>>> > I tried with character set images used in spa training but I also had >>>>> > errors. >>>>> > >>>>> > Somebody can give me a simple example step by step to train tesseract >>>>> for >>>>> > specific charset? >>>>> > >>>>> > Thanks in advance, >>>>> > Esteban. >>>>> > >>>>> > -- >>>>> > You received this message because you are subscribed to the Google >>>>> > Groups "tesseract-ocr" group. >>>>> > To post to this group, send email to [email protected] >>>>> > To unsubscribe from this group, send email to >>>>> > [email protected] >>>>> > For more options, visit this group at >>>>> > http://groups.google.com/group/tesseract-ocr?hl=en >>>>> > >>>>> >>>> >>>> -- >>>> You received this message because you are subscribed to the Google >>>> Groups "tesseract-ocr" group. >>>> To post to this group, send email to [email protected] >>>> To unsubscribe from this group, send email to >>>> [email protected] >>>> For more options, visit this group at >>>> http://groups.google.com/group/tesseract-ocr?hl=en >>>> >>> >>> -- >>> You received this message because you are subscribed to the Google >>> Groups "tesseract-ocr" group. >>> To post to this group, send email to [email protected] >>> To unsubscribe from this group, send email to >>> [email protected] >>> For more options, visit this group at >>> http://groups.google.com/group/tesseract-ocr?hl=en >>> >> >> -- >> You received this message because you are subscribed to the Google >> Groups "tesseract-ocr" group. >> To post to this group, send email to [email protected] >> To unsubscribe from this group, send email to >> [email protected] >> For more options, visit this group at >> http://groups.google.com/group/tesseract-ocr?hl=en >> > > -- > You received this message because you are subscribed to the Google > Groups "tesseract-ocr" group. > To post to this group, send email to [email protected] > To unsubscribe from this group, send email to > [email protected] > For more options, visit this group at > http://groups.google.com/group/tesseract-ocr?hl=en > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en

