The training process uses the list of fonts from
https://github.com/tesseract-ocr/tesseract/blob/master/training/language-specific.sh

You need to update it to match the fonts available with you for the script
you are training and include the correct location for the fonts directory.

ShreeDevi
____________________________________________________________
भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com

On Wed, Jul 26, 2017 at 7:17 AM, <[email protected]> wrote:

> Hello,
>
> I'm trying to train my own traineddata with Tess4.0 following the tutorail:
> https://github.com/tesseract-ocr/tesseract/wiki/TrainingTesseract-4.00---
> Replace-Top-Layer
>
> When executing the command:
> training/tesstrain.sh --fonts_dir /usr/share/fonts --lang chi_sim \
> --training_text ../training_data/part.txt \
> --linedata_only --noextract_font_properties \
> --langdata_dir ../langdata --tessdata_dir ./tessdata \
> --output_dir ~/tesstutorial/chisim
>
> An error appears: "Could not find font named AR PL UMing Patched Light",
> showed in the follow img.
>
> Then I search for the package of "AR PL UMing Patched Light.ttf" with
> Baidu, Google and some other search engines, but cannot find the result.
>
> Can you help me? I don't know if there are other solutions for this
> problem.
>
>
> <https://lh3.googleusercontent.com/-bIQDUqOL0CY/WXfzap2RqGI/AAAAAAAAAAg/9fSidDlSzTEVMsveUmHI__IlwPl4iXXlgCLcBGAs/s1600/Q%2560%255DXS%2524U%255B%2560A%257E1W%2528%2528I4DG%2525AON.png>
>
> --
> You received this message because you are subscribed to the Google Groups
> "tesseract-ocr" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to [email protected].
> To post to this group, send email to [email protected].
> Visit this group at https://groups.google.com/group/tesseract-ocr.
> To view this discussion on the web visit https://groups.google.com/d/
> msgid/tesseract-ocr/825ee74a-854f-4a46-b911-3e3c6bd56427%
> 40googlegroups.com
> <https://groups.google.com/d/msgid/tesseract-ocr/825ee74a-854f-4a46-b911-3e3c6bd56427%40googlegroups.com?utm_medium=email&utm_source=footer>
> .
> For more options, visit https://groups.google.com/d/optout.
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To post to this group, send email to [email protected].
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduXT33oR-FHNSXrNaap28Y%3Dkq%2Bh%2B4b%2BmLh0Mjkn_Wrq-3g%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.

Reply via email to