Hi, I believe the real problem might not be a false encoding.
Which pdfviewer did you use to view the PDF? If you are not using the Adobe Reader japanese characters will probably not display because Adobe fonts are used to render japanese. Since you're using a linux machine you probably don't want to use the Adobe Reader. In that case you need to install several fonts as described in the README.txt in mwlib.rl . You can start by only installing "AR PL UMing HK" since this is the only font you need for japanese. Please let me know if that solves your problem. Furthermore any help/hints etc. regarding rendering PDFs in japanese is highly appreciated. If you'd like to contribute (not necessarily code) this would be great ;) Regards, Volker Yoichi KATO wrote: > Hello - > > I've tested mw-render as mentioned in mw-render examples page on my > CentOS box and found that encoding setting for mw-render isn't > correctly handled for Japanese environment. > > Here's what I tried: > > $ mw-render -c :en -w rl -o nasa.pdf NASA > > to get English Wikipedia, which works properly, but for Japanese WP: > > $ mw-render --config=http://ja.wikipedia.org/w/ -L ja -w rl -o > nasa_ja.pdf NASA > > gives me an pdf file with white boxes for Japanese specific > characters. When I cut/paste garbled characters from pdf to text > editor, everything looks fine. > > PDF file property window says it has UniGB-UCS2-H encoding. It is for > Chinise language environment, and shoud be with UniJIS-UCS2-H encoding > (as described in mwlib/reportlab/pdfbase/_cidfontdata.py). > > Can somebody tell how to generate pdf file with a correct Japanese > encoding? > > Cheers, > > > > > -- volker haas brainbot technologies ag fon +49 6131 2116394 boppstraße 64 fax +49 6131 2116392 55118 mainz [email protected] http://www.brainbot.com/ --~--~---------~--~----~------------~-------~--~----~ You received this message because you are subscribed to the Google Groups "mwlib" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/mwlib?hl=en -~----------~----~----~----~------~----~------~--~---
