Moriyoshi Koizumi wrote: > > Japanese developers has continuously been suffering from this moji- > bake issue too and quite a few guys among them don't know even why > moji-bake occurs. So it seems I'm hardly able to explain it in a > nutshell. > > I guess the text I'm mentioning below may help you and so does a book > by the auther. Although the intended audience is a rather expertised > developer, this is worth a look. > > ftp://ftp.ora.com/pub/examples/nutshell/ujip/doc/cjk.inf
Yes, mojibake problems do seem to lead a life of their own - very unpredictable. The article you mention is by Ken Lunde. I've got his book "CJKV Information Processing" and refer to it from time to time. It is, as you say, very specialized, but I find it quite useful, even though I don't understand everything. I've been reading it since last writing to you, and see he refers to EUC as being more stable for Japanese and other East Asian languages. So that's obviously a direction I need to be moving towards. > Microsoft Word is not capable of handling multiple charsets other > than SJIS. If you want to use it with PHP, you should apply encoding > conversion filter such as iconv or nkf to the generated pages before > you get them parsed by php. Right, I'll look into it. > Browsers send the input by the encoding(charset) in which the page > that contains the form is written regardless of the platform they run > on. Ah, that's very useful to know. So presumably material generated in Shift_JIS can be uploaded through a form on EUC-JP page, and it will be converted into EUC-JP. Or have I misunderstood? > I'm going to make a quick webpage about this instead of directing it > as a mail to you. Please wait until it's completed. I'm sure that will be very useful to many people, not just myself. Thank you. > Then, why don't you subscribe the Japanese PHP user list and ask your > question there? The subscribers may lead you to the right direction. > I'll be glad to see you at the list. I might do so, but I'm very busy. Reading Japanese is one thing. Writing a question that will make sense is another. Still, if people like you can write so well in English, I shouldn't be lazy about writing in Japanese either. >> Once I get things sorted out, I would be very happy to assist by >> creating a brief guide in English to setting up PHP to handle >> Japanese. A lot of the current PHP documentation is difficult for >> the non-expert to understand. I review web design books on a regular >> basis, and a leading computer publisher has told me it is very >> interested in including more on i18n and l10n in its publications, >> so this could serve a dual purpose. > > Sounds interesting! I'll drop you a line if I need your assistance. Please do. Thanks for everything, David Powers -- PHP Internationalization Mailing List (http://www.php.net/) To unsubscribe, visit: http://www.php.net/unsub.php