Sorry, I forgot attach it. Anyway font_properties is used from v 3.01 and I
am using v 3.00

cheers,
Esteban.

2011/6/21 zdenko podobny <[email protected]>

> If you got error on font_properties file, send also font_properties  ;-)
>
> Zdenko
>
> On Tue, Jun 21, 2011 at 2:45 PM, Esteban Bordón <[email protected]> wrote:
>
>> For example using these files provides in
>> http://tesseract-ocr.googlecode.com/files/boxtiff-2.01.spa.tar.gz and the
>> command lines bellow
>>
>> *]$ tesseract spa.cour.g4.tif spa.cour.g4 nobatch box.train
>> ]$ unicharset_extractor spa.cour.g4.box*
>>
>> These commands work ok but I don't know how I must continue
>> If I run:
>> *]$ mftraining -F font_properties -U unicharset spa.cour.g4.tr*
>> I get
>> *Reading spa.cour.g4.tr ...
>> spa.cour.g4 has no defined properties.
>>
>> Error: Illegal short name for a feature!
>>
>> Fatal error: No error trap defined!
>> Signal_termination_handler called with signal 2000*
>>
>> Now I'm trying in tesseract 3.00, then I can't use font_properties:
>> *[ebordon@ebordon ]$ mftraining -U unicharset spa.cour.g4.tr Reading
>> spa.cour.g4.tr ...
>> spa.cour.g4 has no defined properties.
>>
>> Error: Illegal short name for a feature!
>>
>> Fatal error: No error trap defined!
>> Signal_termination_handler called with signal 2000
>> *
>> and:
>>
>> *[ebordon@ebordon ]$ cntraining spa.cour.g4.tr
>> Reading spa.cour.g4.tr ...
>>
>> Error: Illegal short name for a feature!
>>
>> Fatal error: No error trap defined!
>> Signal_termination_handler called with signal 2000
>> *
>> Thanks,
>> Esteban.
>>
>>
>>
>> 2011/6/20 Dmitri Silaev <[email protected]>
>>
>>> You have to show us your training images, resulted box files and all
>>> used command lines.
>>>
>>> Warm regards,
>>> Dmitri Silaev
>>> www.CustomOCR.com
>>>
>>>
>>>
>>>
>>>
>>> On Mon, Jun 20, 2011 at 8:04 PM, Esteban Bordón <[email protected]>
>>> wrote:
>>> > Hi all!
>>> >
>>> > I'm working on a project that wants to digitize judicial expedients. We
>>> want
>>> > to use tesseract but we haven't had great results.
>>> > I think that if I train tesseract very specifically for the kind of
>>> font
>>> > that the expedients uses we could increase the positive results but I
>>> > couldn't trained my character set.
>>> > I have installed tesseract 3.01 in Ubuntu 11.04 and I followed the
>>> > instructions posted on
>>> > http://code.google.com/p/tesseract-ocr/wiki/TrainingTesseract3.
>>> > In the step
>>> >
>>> http://code.google.com/p/tesseract-ocr/wiki/TrainingTesseract3#Run_Tesseract_for_Training
>>> > I've got many FATALITIES and I don't know how can I fix it.
>>> >
>>> > I tried with character set images used in spa training but I also had
>>> > errors.
>>> >
>>> > Somebody can give me a simple example step by step to train tesseract
>>> for
>>> > specific charset?
>>> >
>>> > Thanks in advance,
>>> > Esteban.
>>> >
>>> > --
>>> > You received this message because you are subscribed to the Google
>>> > Groups "tesseract-ocr" group.
>>> > To post to this group, send email to [email protected]
>>> > To unsubscribe from this group, send email to
>>> > [email protected]
>>> > For more options, visit this group at
>>> > http://groups.google.com/group/tesseract-ocr?hl=en
>>> >
>>>
>>
>>  --
>> You received this message because you are subscribed to the Google
>> Groups "tesseract-ocr" group.
>> To post to this group, send email to [email protected]
>> To unsubscribe from this group, send email to
>> [email protected]
>> For more options, visit this group at
>> http://groups.google.com/group/tesseract-ocr?hl=en
>>
>
>  --
> You received this message because you are subscribed to the Google
> Groups "tesseract-ocr" group.
> To post to this group, send email to [email protected]
> To unsubscribe from this group, send email to
> [email protected]
> For more options, visit this group at
> http://groups.google.com/group/tesseract-ocr?hl=en
>

-- 
You received this message because you are subscribed to the Google
Groups "tesseract-ocr" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to
[email protected]
For more options, visit this group at
http://groups.google.com/group/tesseract-ocr?hl=en

Attachment: font_properties
Description: Binary data

Reply via email to