Hello - I've tested mw-render as mentioned in mw-render examples page on my CentOS box and found that encoding setting for mw-render isn't correctly handled for Japanese environment.
Here's what I tried: $ mw-render -c :en -w rl -o nasa.pdf NASA to get English Wikipedia, which works properly, but for Japanese WP: $ mw-render --config=http://ja.wikipedia.org/w/ -L ja -w rl -o nasa_ja.pdf NASA gives me an pdf file with white boxes for Japanese specific characters. When I cut/paste garbled characters from pdf to text editor, everything looks fine. PDF file property window says it has UniGB-UCS2-H encoding. It is for Chinise language environment, and shoud be with UniJIS-UCS2-H encoding (as described in mwlib/reportlab/pdfbase/_cidfontdata.py). Can somebody tell how to generate pdf file with a correct Japanese encoding? Cheers, --~--~---------~--~----~------------~-------~--~----~ You received this message because you are subscribed to the Google Groups "mwlib" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/mwlib?hl=en -~----------~----~----~----~------~----~------~--~---
