what OS you use and which tesseract version?

Zdenko

PS: it worked on windows XP with tesseract 3.00

On Tue, Jun 21, 2011 at 3:17 PM, Esteban Bordón <[email protected]> wrote:

> Sorry, I forgot attach it. Anyway font_properties is used from v 3.01 and I
> am using v 3.00
>
> cheers,
> Esteban.
>
> 2011/6/21 zdenko podobny <[email protected]>
>
>> If you got error on font_properties file, send also font_properties  ;-)
>>
>> Zdenko
>>
>> On Tue, Jun 21, 2011 at 2:45 PM, Esteban Bordón <[email protected]>wrote:
>>
>>> For example using these files provides in
>>> http://tesseract-ocr.googlecode.com/files/boxtiff-2.01.spa.tar.gz and
>>> the command lines bellow
>>>
>>> *]$ tesseract spa.cour.g4.tif spa.cour.g4 nobatch box.train
>>> ]$ unicharset_extractor spa.cour.g4.box*
>>>
>>> These commands work ok but I don't know how I must continue
>>> If I run:
>>> *]$ mftraining -F font_properties -U unicharset spa.cour.g4.tr*
>>> I get
>>> *Reading spa.cour.g4.tr ...
>>> spa.cour.g4 has no defined properties.
>>>
>>> Error: Illegal short name for a feature!
>>>
>>> Fatal error: No error trap defined!
>>> Signal_termination_handler called with signal 2000*
>>>
>>> Now I'm trying in tesseract 3.00, then I can't use font_properties:
>>> *[ebordon@ebordon ]$ mftraining -U unicharset spa.cour.g4.tr Reading
>>> spa.cour.g4.tr ...
>>> spa.cour.g4 has no defined properties.
>>>
>>> Error: Illegal short name for a feature!
>>>
>>> Fatal error: No error trap defined!
>>> Signal_termination_handler called with signal 2000
>>> *
>>> and:
>>>
>>> *[ebordon@ebordon ]$ cntraining spa.cour.g4.tr
>>> Reading spa.cour.g4.tr ...
>>>
>>> Error: Illegal short name for a feature!
>>>
>>> Fatal error: No error trap defined!
>>> Signal_termination_handler called with signal 2000
>>> *
>>> Thanks,
>>> Esteban.
>>>
>>>
>>>
>>> 2011/6/20 Dmitri Silaev <[email protected]>
>>>
>>>> You have to show us your training images, resulted box files and all
>>>> used command lines.
>>>>
>>>> Warm regards,
>>>> Dmitri Silaev
>>>> www.CustomOCR.com
>>>>
>>>>
>>>>
>>>>
>>>>
>>>> On Mon, Jun 20, 2011 at 8:04 PM, Esteban Bordón <[email protected]>
>>>> wrote:
>>>> > Hi all!
>>>> >
>>>> > I'm working on a project that wants to digitize judicial expedients.
>>>> We want
>>>> > to use tesseract but we haven't had great results.
>>>> > I think that if I train tesseract very specifically for the kind of
>>>> font
>>>> > that the expedients uses we could increase the positive results but I
>>>> > couldn't trained my character set.
>>>> > I have installed tesseract 3.01 in Ubuntu 11.04 and I followed the
>>>> > instructions posted on
>>>> > http://code.google.com/p/tesseract-ocr/wiki/TrainingTesseract3.
>>>> > In the step
>>>> >
>>>> http://code.google.com/p/tesseract-ocr/wiki/TrainingTesseract3#Run_Tesseract_for_Training
>>>> > I've got many FATALITIES and I don't know how can I fix it.
>>>> >
>>>> > I tried with character set images used in spa training but I also had
>>>> > errors.
>>>> >
>>>> > Somebody can give me a simple example step by step to train tesseract
>>>> for
>>>> > specific charset?
>>>> >
>>>> > Thanks in advance,
>>>> > Esteban.
>>>> >
>>>> > --
>>>> > You received this message because you are subscribed to the Google
>>>> > Groups "tesseract-ocr" group.
>>>> > To post to this group, send email to [email protected]
>>>> > To unsubscribe from this group, send email to
>>>> > [email protected]
>>>> > For more options, visit this group at
>>>> > http://groups.google.com/group/tesseract-ocr?hl=en
>>>> >
>>>>
>>>
>>>  --
>>> You received this message because you are subscribed to the Google
>>> Groups "tesseract-ocr" group.
>>> To post to this group, send email to [email protected]
>>> To unsubscribe from this group, send email to
>>> [email protected]
>>> For more options, visit this group at
>>> http://groups.google.com/group/tesseract-ocr?hl=en
>>>
>>
>>  --
>> You received this message because you are subscribed to the Google
>> Groups "tesseract-ocr" group.
>> To post to this group, send email to [email protected]
>> To unsubscribe from this group, send email to
>> [email protected]
>> For more options, visit this group at
>> http://groups.google.com/group/tesseract-ocr?hl=en
>>
>
>  --
> You received this message because you are subscribed to the Google
> Groups "tesseract-ocr" group.
> To post to this group, send email to [email protected]
> To unsubscribe from this group, send email to
> [email protected]
> For more options, visit this group at
> http://groups.google.com/group/tesseract-ocr?hl=en
>

-- 
You received this message because you are subscribed to the Google
Groups "tesseract-ocr" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to
[email protected]
For more options, visit this group at
http://groups.google.com/group/tesseract-ocr?hl=en

Reply via email to