Hello,

Please see bug-report and suggested solution:
https://github.com/tesseract-ocr/tesseract/issues/1252

I guess problem is in pango, but we would like to test it. Are you able to
create simple test case (provide small chi_sim.txt and share font if it is
possible) for this issue?

Zdenko


ut 6. 11. 2018 o 10:56 bruce <[email protected]> napísal(a):

> I use the command as follows to find the fonts I can use to train my
> language.
> *text2image.exe --text=chi_sim.txt --outputbase=chi_sim.庞中华行书.exp0
> --fints_dir=C:\Windows\Fonts --find_fonts*
> and i got the result as follows:
>                                                 Font MStiffHeiPRC failed
> with 414359 hits = 100.00%
>                                                 Font MStiffHeiPRC failed
> with 414359 hits = 100.00%
>                                                 Font MStiffHeiPRC failed
> with 414359 hits = 100.00%
>                                                 Font MStiffHeiPRC failed
> with 414359 hits = 100.00%
>                                                 Font MStream PRC failed
> with 414359 hits = 100.00%
>                                                 Font MSung PRC failed with
> 414359 hits = 100.00%
>                                                 Font MSung PRC failed with
> 414359 hits = 100.00%
>                                                 庞中华行书 Light : 414361 hits
> = 100.00%, raw = 3440 = 100.00%
>                                                 Font 剑客毛笔行书 failed with
> 414357 hits = 100.00%
>                                                 Font 可可漫雪体 failed with
> 414360 hits = 100.00%
>                                                 Font 多米手写体 failed with
> 414253 hits = 99.97%
>                                                 Font 字体中国-锐博体V1 failed
> with 414359 hits = 100.00%
>                                                 Font 孙运和酷楷 failed with
> 414359 hits = 100.00%
>                                                 Font 建刚静心楷 failed with
> 414359 hits = 100.00%
>                                                 Font 张维镜手写楷书 Medium
> failed with 410014 hits = 98.95%
>                                                 Font 徐金如硬笔行楷X failed with
> 413042 hits = 99.68%
>
>
>
> Than I use command like this:*text2image.exe --text=chi_sim.txt
> --outputbase=chi_sim.庞中华行书.exp0 --ptsize 36 --font "庞中华行书" --fonts_dir
> C:\Windows\Fonts*
> I got an error resut as follows:
>                                                Could not find font named
> '庞中华行书'.
>                                                Pango suggested font
> 'MingLiU'.
>                                                Please correct --font arg.
>
> text2image not support chinese name fonts?How could i use these chinese
> name fonts?
>
> --
> You received this message because you are subscribed to the Google Groups
> "tesseract-ocr" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to [email protected].
> To post to this group, send email to [email protected].
> Visit this group at https://groups.google.com/group/tesseract-ocr.
> To view this discussion on the web visit
> https://groups.google.com/d/msgid/tesseract-ocr/a9a31397-9196-4923-aa79-43d151d534a1%40googlegroups.com
> <https://groups.google.com/d/msgid/tesseract-ocr/a9a31397-9196-4923-aa79-43d151d534a1%40googlegroups.com?utm_medium=email&utm_source=footer>
> .
> For more options, visit https://groups.google.com/d/optout.
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To post to this group, send email to [email protected].
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/CAJbzG8w6xBf53V4L86b9%3DyUqvTP%3Dz5VxtmhG3FRtxbr1cEKsHw%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.

Reply via email to