Hello!
I have a question regarding some particular Japanese words.

I have a MySQL database and have set the character set for selected
tables to sjis (I have also tried this with ujis and various php
settings)

php.ini settings for mbstring are as follows:

     mbstring.language=Japanese
     mbstring.internal_encoding=SJIS
     mbstring.http_input=auto
     mbstring.http_input=UTF-8
     mbstring.encoding_translation=on
     mbstring.detect_order=auto
     mbstring.substitute_character=none

[the server was configured by our network person so i am not 100% sure
of the reasoning behind these particular settings.]

The server is Apache using an additional module for Japanese from webDAV
called mod_encoding
with ServerEncoding set to utf-8 & DefaultClientEncoding as
JA-AUTO-SJIS-MS

Each page is encoded as shift_jis. (i have also played around with these
and the get the same problems if this is changes to utf-8 or euc-jp)

Generally, the html pages display Japanese without problem, however
there are a couple of particular kanji that do not display properly.
When entered into a html form they look fine, when the query the data is
used for is echoed back to the screen all is fine, but once they are
actually entered into the MySQL database they change.

Examples of problem kanji...
yo-so-ku -> when this is stored in the database it changes to the kanji
for egg and a small katakana i...
yo-so-u  -> when this is stored in the database it changes to the kanji
for egg and a small z...
also
ko-u-chi-ku
ju-u-bu-n
hyo-u-to
hyo-u-sho-u

Can this be changed by altering the encoding scheme chosen or are these
particular problem kanji and should just be avoided?


I would really appreciate your expertise in helping me make sense of
this.
Thanks,
Claire

Reply via email to