Yes .I tried tessdata - kor.trainnedata /// But it is not good enough. sorry .ㅜㅜ i can not use tesseract 4.0 tessdata-kor.trainnedata. in bussiness ..
So I must train 4.00 kor ... Thank you for advice 2018년 3월 1일 목요일 오후 12시 59분 31초 UTC+9, shree 님의 말: > > > On Thu, Mar 1, 2018 at 9:21 AM, 이경준 <[email protected] <javascript:>> > wrote: > >> Thank U reply my question. >> >> But my system is operated by Ubuntu 16.04. 03 LTS >> >> I think that that path is not working ? Am I false? >> >> >> 2018년 2월 28일 수요일 오후 6시 18분 41초 UTC+9, shree 님의 말: >>> >>> Try with following - make sure that you change all variables with dir to >>> match your setup >>> >>> tesstrain.sh \ >>> --lang kor \ >>> --noextract_font_properties \ >>> --linedata_only \ >>> * --langdata_dir ../langdata \* >>> * --tessdata_dir ../tessdata \* >>> * --fonts_dir **/mnt/c/Windows/Fonts** \* >>> --fontlist \ >>> "Arial Unicode MS" \ >>> * --output_dir ../tesstutorial/kor* >>> >>> The fontlist you specify in command will override the list in >>> language_specific.sh >>> >>> >>> > Tesseract4.00alpha gives good results for Korean recognition. Have you > tried that? You may not need to do training. > > If you want to do training for 4.00, you need files from langdata and > tessdata_best. > > https://github.com/tesseract-ocr/langdata > https://github.com/tesseract-ocr/ tessdata_best > > see https://github.com/tesseract-ocr/langdata/blob/master/README.md > > > > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at https://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/88825700-8da1-4fc7-be6e-1bccdf0848d5%40googlegroups.com. For more options, visit https://groups.google.com/d/optout.

