Hi ShreeDev, Thank you for the quick response.
I modified langdata/jpn/jpn.config and training/language_specific.sh as you suggested. Things went well. No error has occurred. Thanks. 2017年4月3日月曜日 23時37分59秒 UTC+9 shree: > > jpn.config in langdata/jpn is loading jpn_vert as a sublanguage > > tessedit_load_sublangs jpn_vert > > You can try without that > > Also look at the settings for jpn in training/language_specific.sh > > You may need to change the following also .. > > > # The following fonts will be rendered vertically in phase I. > VERTICAL_FONTS=( \ > "TakaoExGothic" \ # for jpn > "TakaoExMincho" \ # for jpn > "AR PL UKai Patched" \ # for chi_tra > "AR PL UMing Patched Light" \ # for chi_tra > "Baekmuk Batang Patched" \ # for kor > ) > > > ShreeDevi > ____________________________________________________________ > भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com > > On Mon, Apr 3, 2017 at 4:22 PM, <[email protected] <javascript:>> > wrote: > >> Hi, >> >> I'm trying to creating training data for Japanese (jpn.traineddata). >> >> I run 'tesstrain.sh' with '--linedataonly' option, and the script has >> finished ( return code 0 ) . >> But log file has contained some error messages ( repeated 22 times ). >> >> ``` >> $ ../tesseract-ocr/training/tesstrain.sh --fonts_dir /usr/share/fonts >> --lang jpn --linedata_only --noextract_font_properties --langdata_dir >> ../langdata --tessdata_dir /usr/local/share --output_dir ~/work/jpntrain >> ``` >> >> >> --- >> [Sun Apr 2 07:42:30 UTC 2017] /usr/local/bin/tesseract >> /tmp/tmp.pwcwGMb5hs/jpn/jpn.IPAPMincho.exp0.tif /tmp/tmp.pwcwGMb5hs/jpn/jpn. >> IPAPMincho.exp0 lstm.train ../langdata/jpn/jpn.config >> [Sun Apr 2 07:42:30 UTC 2017] /usr/local/bin/tesseract >> /tmp/tmp.pwcwGMb5hs/jpn/jpn.IPAGothic.exp0.tif /tmp/tmp.pwcwGMb5hs/jpn/jpn.I >> PAGothic.exp0 lstm.train ../langdata/jpn/jpn.config >> Error opening data file /usr/local/share/tessdata/jpn_vert.traineddata >> Please make sure the TESSDATA_PREFIX environment variable is set to the >> parent directory of your "tessdata" directory. >> Failed loading language 'jpn_vert' >> --- >> >> It seems that 'tesstrain.sh' requires 'jpn_vert.traineddata`, but this >> file not provide on tessdata repository. >> >> How I get this file? Or, Can I substitute 'jpn.traineddata' for >> 'jpn_vert.traineddata' ? >> >> >> I've found that there is `jpn_vert' directory on langdata repository, but >> only some config files. >> >> >> Thanks. >> >> -- >> You received this message because you are subscribed to the Google Groups >> "tesseract-ocr" group. >> To unsubscribe from this group and stop receiving emails from it, send an >> email to [email protected] <javascript:>. >> To post to this group, send email to [email protected] >> <javascript:>. >> Visit this group at https://groups.google.com/group/tesseract-ocr. >> To view this discussion on the web visit >> https://groups.google.com/d/msgid/tesseract-ocr/c776398d-0b2f-483d-a9ec-63476eaf0586%40googlegroups.com >> >> <https://groups.google.com/d/msgid/tesseract-ocr/c776398d-0b2f-483d-a9ec-63476eaf0586%40googlegroups.com?utm_medium=email&utm_source=footer> >> . >> For more options, visit https://groups.google.com/d/optout. >> > > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at https://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/d0c4a8d8-893d-4a26-b67e-1b9e2624d59c%40googlegroups.com. For more options, visit https://groups.google.com/d/optout.

