OK. Thanks for the reply from Shree sincerely. 在 2017年7月26日星期三 UTC+8下午2:48:13,shree写道: > > I do not have this font. > > The training is done at Google. They probably use a number of commercial > fonts in addition to freely available fonts. The fonts are not provided as > part of the training data. > > You have to get your own set of fonts to train or wait for the new > traineddata by Ray (expected in next few weeks). > > ShreeDevi > ____________________________________________________________ > भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com > > On Wed, Jul 26, 2017 at 11:09 AM, <[email protected] <javascript:>> > wrote: > >> Yeah, I know that. But I lack the font of AR PL UMing Patched Light, >> which cannot be found in the Internet. >> >> I'm afraid that I may need to find this package (the font of AR PL UMing >> Patched Light) from you. If you don't mind sharing your resources, thanks >> sincerely. >> >> 在 2017年7月26日星期三 UTC+8上午11:31:23,shree写道: >>> >>> The training process uses the list of fonts from >>> https://github.com/tesseract-ocr/tesseract/blob/master/training/language-specific.sh >>> >>> You need to update it to match the fonts available with you for the >>> script you are training and include the correct location for the fonts >>> directory. >>> >>> ShreeDevi >>> ____________________________________________________________ >>> भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com >>> >>> On Wed, Jul 26, 2017 at 7:17 AM, <[email protected]> wrote: >>> >>>> Hello, >>>> >>>> I'm trying to train my own traineddata with Tess4.0 following the >>>> tutorail: >>>> https://github.com/tesseract-ocr/tesseract/wiki/TrainingTesseract-4.00---Replace-Top-Layer >>>> >>>> When executing the command: >>>> training/tesstrain.sh --fonts_dir /usr/share/fonts --lang chi_sim \ >>>> --training_text ../training_data/part.txt \ >>>> --linedata_only --noextract_font_properties \ >>>> --langdata_dir ../langdata --tessdata_dir ./tessdata \ >>>> --output_dir ~/tesstutorial/chisim >>>> >>>> An error appears: "Could not find font named AR PL UMing Patched >>>> Light", showed in the follow img. >>>> >>>> Then I search for the package of "AR PL UMing Patched Light.ttf" with >>>> Baidu, Google and some other search engines, but cannot find the result. >>>> >>>> Can you help me? I don't know if there are other solutions for this >>>> problem. >>>> >>>> >>>> <https://lh3.googleusercontent.com/-bIQDUqOL0CY/WXfzap2RqGI/AAAAAAAAAAg/9fSidDlSzTEVMsveUmHI__IlwPl4iXXlgCLcBGAs/s1600/Q%2560%255DXS%2524U%255B%2560A%257E1W%2528%2528I4DG%2525AON.png> >>>> >>>> -- >>>> You received this message because you are subscribed to the Google >>>> Groups "tesseract-ocr" group. >>>> To unsubscribe from this group and stop receiving emails from it, send >>>> an email to [email protected]. >>>> To post to this group, send email to [email protected]. >>>> Visit this group at https://groups.google.com/group/tesseract-ocr. >>>> To view this discussion on the web visit >>>> https://groups.google.com/d/msgid/tesseract-ocr/825ee74a-854f-4a46-b911-3e3c6bd56427%40googlegroups.com >>>> >>>> <https://groups.google.com/d/msgid/tesseract-ocr/825ee74a-854f-4a46-b911-3e3c6bd56427%40googlegroups.com?utm_medium=email&utm_source=footer> >>>> . >>>> For more options, visit https://groups.google.com/d/optout. >>>> >>> >>> -- >> You received this message because you are subscribed to the Google Groups >> "tesseract-ocr" group. >> To unsubscribe from this group and stop receiving emails from it, send an >> email to [email protected] <javascript:>. >> To post to this group, send email to [email protected] >> <javascript:>. >> Visit this group at https://groups.google.com/group/tesseract-ocr. >> To view this discussion on the web visit >> https://groups.google.com/d/msgid/tesseract-ocr/bd8a12f7-44e6-4ee2-ab98-cad5506a3091%40googlegroups.com >> >> <https://groups.google.com/d/msgid/tesseract-ocr/bd8a12f7-44e6-4ee2-ab98-cad5506a3091%40googlegroups.com?utm_medium=email&utm_source=footer> >> . >> >> For more options, visit https://groups.google.com/d/optout. >> > >
-- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at https://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/ee308604-0f7b-4835-93f7-8db7c2b54435%40googlegroups.com. For more options, visit https://groups.google.com/d/optout.

