A .traineddata file does not store a .font_properties file.
"font_properties" is being used to build the "inttemp" file, which is
a result of the work of the "mftraining" utility.

What can be extracted from the "traineddata" file is the "inttemp",
and this file can be searched in a binary or hex viewer for a frequent
font name like "Arial", as Zdenko once noted. Close to the found
location, then you'll be able to see the list of other font names.

Also you can skip extracting traineddata's components and search
"traineddata" for a font name directly, as "traineddata" stores
components in the uncompressed form. Be aware that font names
mentioned in "font_properties" and then used to build "traineddata"
can be given in a somewhat arbitrary manner and may not correspond to
any real font name, as they only serve to denote a named set of
characters bearing some unique features from the "traineddata"
creator's point of view.

Warm regards,
Dmitri Silaev
www.CustomOCR.com





On Wed, Aug 10, 2011 at 3:50 AM, Quan Nguyen <[email protected]> wrote:
> Tess 3.01-version data file includes .font_properties file, which
> defines the fonts used in training. You can unpack .traineddata file
> to get to it.
>
> http://code.google.com/p/tesseract-ocr/wiki/TrainingTesseract3
>
> On Aug 3, 6:51 am, Sandeep Parmar <[email protected]> wrote:
>> Dear all,
>>
>> can anyone tell me which fonts are included in 'eng.traineddata' of
>> tesseract 3.01?
>>
>> Regards
>> Sandeep
>
> --
> You received this message because you are subscribed to the Google
> Groups "tesseract-ocr" group.
> To post to this group, send email to [email protected]
> To unsubscribe from this group, send email to
> [email protected]
> For more options, visit this group at
> http://groups.google.com/group/tesseract-ocr?hl=en
>

-- 
You received this message because you are subscribed to the Google
Groups "tesseract-ocr" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to
[email protected]
For more options, visit this group at
http://groups.google.com/group/tesseract-ocr?hl=en

Reply via email to