tesstrain.sh

Sorry about typo in earlier msg (autocorrect problem on phone)

On 31-Aug-2017 9:41 PM, "ShreeDevi Kumar" <[email protected]> wrote:

> You can change the fontlist either in language-specific.sh or as a
> parameter when you run tesstrain.sh
>
> Read the wiki pages regarding training for more info.
>
> On 31-Aug-2017 9:38 PM, "ShreeDevi Kumar" <[email protected]> wrote:
>
>> Please see tesseract.sh script file in training directory.
>>
>> It automates the whole training process.
>>
>> On 31-Aug-2017 9:29 PM, "Dan9er" <[email protected]> wrote:
>>
>>> Running
>>> training/text2image --text=npn_training_text.txt --outputbase=npn.Exo.exp0
>>> --font='Exo' --fonts_dir=/usr/share/fonts
>>>
>>> gives the desired output of two files:
>>>
>>>    - npn.Exo.exo0.tif
>>>    - npn.Exo.exp0.box
>>>
>>> But running this command for the 162 fonts I want to use is very time
>>> consuming and monotonous. I tried running this command:
>>> training/text2image --text=npn_training_text.txt --outputbase=npn --
>>> fonts_dir=/usr/share/fonts  --find_fonts --min_coverage=1.0 --
>>> render_per_font=true
>>>
>>> But that only made files in this format: npn.{fontName}.tif
>>>
>>> *How do I automate making .tif AND .box files?* Do I have to change the
>>> --outputbase to something different or do I have to make a .sh script?
>>>
>>> PS. I did run training/text2image --find_fonts with --render_per_font
>>> set to false, so I have a npn.fontlist.txt file on hand.
>>>
>>> --
>>> You received this message because you are subscribed to the Google
>>> Groups "tesseract-ocr" group.
>>> To unsubscribe from this group and stop receiving emails from it, send
>>> an email to [email protected].
>>> To post to this group, send email to [email protected].
>>> Visit this group at https://groups.google.com/group/tesseract-ocr.
>>> To view this discussion on the web visit https://groups.google.com/d/ms
>>> gid/tesseract-ocr/9d7df5ab-e1ad-43a6-9d7b-d7ba4ef39951%40goo
>>> glegroups.com
>>> <https://groups.google.com/d/msgid/tesseract-ocr/9d7df5ab-e1ad-43a6-9d7b-d7ba4ef39951%40googlegroups.com?utm_medium=email&utm_source=footer>
>>> .
>>> For more options, visit https://groups.google.com/d/optout.
>>>
>>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To post to this group, send email to [email protected].
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduWzsC7syxNT_m616p5HGasW3cH6MZW_QicY%2B1%2Bp_tqStg%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.

Reply via email to