I have tried with tesseract 3.00 in Fedora 14 and Ubuntu 11.04.

Which commands you have used?

Maybe I must to try on XP or Windows 7...

2011/6/21 zdenko podobny <[email protected]>

> what OS you use and which tesseract version?
>
> Zdenko
>
> PS: it worked on windows XP with tesseract 3.00
>
> On Tue, Jun 21, 2011 at 3:17 PM, Esteban Bordón <[email protected]> wrote:
>
>> Sorry, I forgot attach it. Anyway font_properties is used from v 3.01 and
>> I am using v 3.00
>>
>> cheers,
>> Esteban.
>>
>> 2011/6/21 zdenko podobny <[email protected]>
>>
>>> If you got error on font_properties file, send also font_properties  ;-)
>>>
>>> Zdenko
>>>
>>> On Tue, Jun 21, 2011 at 2:45 PM, Esteban Bordón <[email protected]>wrote:
>>>
>>>> For example using these files provides in
>>>> http://tesseract-ocr.googlecode.com/files/boxtiff-2.01.spa.tar.gz and
>>>> the command lines bellow
>>>>
>>>> *]$ tesseract spa.cour.g4.tif spa.cour.g4 nobatch box.train
>>>> ]$ unicharset_extractor spa.cour.g4.box*
>>>>
>>>> These commands work ok but I don't know how I must continue
>>>> If I run:
>>>> *]$ mftraining -F font_properties -U unicharset spa.cour.g4.tr*
>>>> I get
>>>> *Reading spa.cour.g4.tr ...
>>>> spa.cour.g4 has no defined properties.
>>>>
>>>> Error: Illegal short name for a feature!
>>>>
>>>> Fatal error: No error trap defined!
>>>> Signal_termination_handler called with signal 2000*
>>>>
>>>> Now I'm trying in tesseract 3.00, then I can't use font_properties:
>>>> *[ebordon@ebordon ]$ mftraining -U unicharset spa.cour.g4.tr Reading
>>>> spa.cour.g4.tr ...
>>>> spa.cour.g4 has no defined properties.
>>>>
>>>> Error: Illegal short name for a feature!
>>>>
>>>> Fatal error: No error trap defined!
>>>> Signal_termination_handler called with signal 2000
>>>> *
>>>> and:
>>>>
>>>> *[ebordon@ebordon ]$ cntraining spa.cour.g4.tr
>>>> Reading spa.cour.g4.tr ...
>>>>
>>>> Error: Illegal short name for a feature!
>>>>
>>>> Fatal error: No error trap defined!
>>>> Signal_termination_handler called with signal 2000
>>>> *
>>>> Thanks,
>>>> Esteban.
>>>>
>>>>
>>>>
>>>> 2011/6/20 Dmitri Silaev <[email protected]>
>>>>
>>>>> You have to show us your training images, resulted box files and all
>>>>> used command lines.
>>>>>
>>>>> Warm regards,
>>>>> Dmitri Silaev
>>>>> www.CustomOCR.com
>>>>>
>>>>>
>>>>>
>>>>>
>>>>>
>>>>> On Mon, Jun 20, 2011 at 8:04 PM, Esteban Bordón <[email protected]>
>>>>> wrote:
>>>>> > Hi all!
>>>>> >
>>>>> > I'm working on a project that wants to digitize judicial expedients.
>>>>> We want
>>>>> > to use tesseract but we haven't had great results.
>>>>> > I think that if I train tesseract very specifically for the kind of
>>>>> font
>>>>> > that the expedients uses we could increase the positive results but I
>>>>> > couldn't trained my character set.
>>>>> > I have installed tesseract 3.01 in Ubuntu 11.04 and I followed the
>>>>> > instructions posted on
>>>>> > http://code.google.com/p/tesseract-ocr/wiki/TrainingTesseract3.
>>>>> > In the step
>>>>> >
>>>>> http://code.google.com/p/tesseract-ocr/wiki/TrainingTesseract3#Run_Tesseract_for_Training
>>>>> > I've got many FATALITIES and I don't know how can I fix it.
>>>>> >
>>>>> > I tried with character set images used in spa training but I also had
>>>>> > errors.
>>>>> >
>>>>> > Somebody can give me a simple example step by step to train tesseract
>>>>> for
>>>>> > specific charset?
>>>>> >
>>>>> > Thanks in advance,
>>>>> > Esteban.
>>>>> >
>>>>> > --
>>>>> > You received this message because you are subscribed to the Google
>>>>> > Groups "tesseract-ocr" group.
>>>>> > To post to this group, send email to [email protected]
>>>>> > To unsubscribe from this group, send email to
>>>>> > [email protected]
>>>>> > For more options, visit this group at
>>>>> > http://groups.google.com/group/tesseract-ocr?hl=en
>>>>> >
>>>>>
>>>>
>>>>  --
>>>> You received this message because you are subscribed to the Google
>>>> Groups "tesseract-ocr" group.
>>>> To post to this group, send email to [email protected]
>>>> To unsubscribe from this group, send email to
>>>> [email protected]
>>>> For more options, visit this group at
>>>> http://groups.google.com/group/tesseract-ocr?hl=en
>>>>
>>>
>>>  --
>>> You received this message because you are subscribed to the Google
>>> Groups "tesseract-ocr" group.
>>> To post to this group, send email to [email protected]
>>> To unsubscribe from this group, send email to
>>> [email protected]
>>> For more options, visit this group at
>>> http://groups.google.com/group/tesseract-ocr?hl=en
>>>
>>
>>  --
>> You received this message because you are subscribed to the Google
>> Groups "tesseract-ocr" group.
>> To post to this group, send email to [email protected]
>> To unsubscribe from this group, send email to
>> [email protected]
>> For more options, visit this group at
>> http://groups.google.com/group/tesseract-ocr?hl=en
>>
>
>  --
> You received this message because you are subscribed to the Google
> Groups "tesseract-ocr" group.
> To post to this group, send email to [email protected]
> To unsubscribe from this group, send email to
> [email protected]
> For more options, visit this group at
> http://groups.google.com/group/tesseract-ocr?hl=en
>

-- 
You received this message because you are subscribed to the Google
Groups "tesseract-ocr" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to
[email protected]
For more options, visit this group at
http://groups.google.com/group/tesseract-ocr?hl=en

Reply via email to