Hello, Please see bug-report and suggested solution: https://github.com/tesseract-ocr/tesseract/issues/1252
I guess problem is in pango, but we would like to test it. Are you able to create simple test case (provide small chi_sim.txt and share font if it is possible) for this issue? Zdenko ut 6. 11. 2018 o 10:56 bruce <[email protected]> napísal(a): > I use the command as follows to find the fonts I can use to train my > language. > *text2image.exe --text=chi_sim.txt --outputbase=chi_sim.庞中华行书.exp0 > --fints_dir=C:\Windows\Fonts --find_fonts* > and i got the result as follows: > Font MStiffHeiPRC failed > with 414359 hits = 100.00% > Font MStiffHeiPRC failed > with 414359 hits = 100.00% > Font MStiffHeiPRC failed > with 414359 hits = 100.00% > Font MStiffHeiPRC failed > with 414359 hits = 100.00% > Font MStream PRC failed > with 414359 hits = 100.00% > Font MSung PRC failed with > 414359 hits = 100.00% > Font MSung PRC failed with > 414359 hits = 100.00% > 庞中华行书 Light : 414361 hits > = 100.00%, raw = 3440 = 100.00% > Font 剑客毛笔行书 failed with > 414357 hits = 100.00% > Font 可可漫雪体 failed with > 414360 hits = 100.00% > Font 多米手写体 failed with > 414253 hits = 99.97% > Font 字体中国-锐博体V1 failed > with 414359 hits = 100.00% > Font 孙运和酷楷 failed with > 414359 hits = 100.00% > Font 建刚静心楷 failed with > 414359 hits = 100.00% > Font 张维镜手写楷书 Medium > failed with 410014 hits = 98.95% > Font 徐金如硬笔行楷X failed with > 413042 hits = 99.68% > > > > Than I use command like this:*text2image.exe --text=chi_sim.txt > --outputbase=chi_sim.庞中华行书.exp0 --ptsize 36 --font "庞中华行书" --fonts_dir > C:\Windows\Fonts* > I got an error resut as follows: > Could not find font named > '庞中华行书'. > Pango suggested font > 'MingLiU'. > Please correct --font arg. > > text2image not support chinese name fonts?How could i use these chinese > name fonts? > > -- > You received this message because you are subscribed to the Google Groups > "tesseract-ocr" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to [email protected]. > To post to this group, send email to [email protected]. > Visit this group at https://groups.google.com/group/tesseract-ocr. > To view this discussion on the web visit > https://groups.google.com/d/msgid/tesseract-ocr/a9a31397-9196-4923-aa79-43d151d534a1%40googlegroups.com > <https://groups.google.com/d/msgid/tesseract-ocr/a9a31397-9196-4923-aa79-43d151d534a1%40googlegroups.com?utm_medium=email&utm_source=footer> > . > For more options, visit https://groups.google.com/d/optout. > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at https://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/CAJbzG8w6xBf53V4L86b9%3DyUqvTP%3Dz5VxtmhG3FRtxbr1cEKsHw%40mail.gmail.com. For more options, visit https://groups.google.com/d/optout.

