ID: 33793
Updated by: [EMAIL PROTECTED]
Reported By: lars dot jensen at careercross dot com
-Status: Open
+Status: Assigned
Bug Type: mbstring related
Operating System: FreeBSD 5.3
PHP Version: 5.0.3
-Assigned To:
+Assigned To: moriyoshi
New Comment:
Assigned to the maintainer.
Previous Comments:
------------------------------------------------------------------------
[2005-07-21 03:19:47] lars dot jensen at careercross dot com
Description:
------------
Writing a class to handle conversion of UTF-8 input into SJIS using
usual
$body = mb_convert_encoding($body, "SJIS", "UTF-8");
function, usually works, but so far by testing, I identified three
kanji's which makes this function fail to convert these correctly and
thus causing mojibake
The three UTF-8 characters is identified as follows
chr(227).chr(136).chr(177)
chr(227).chr(136).chr(178)
chr(227).chr(136).chr(185)
which in SJIS corresponds to
chr(135).chr(138)
chr(135).chr(139)
chr(135).chr(140)
Reproduce code:
---------------
I created a "quick'n'dirty" solution as follows, which surely isnt
optimal
$body = str_replace(chr(227).chr(136).chr(177), '#mojihack1#', $body);
$body = str_replace(chr(227).chr(136).chr(178), '#mojihack2#', $body);
$body = str_replace(chr(227).chr(136).chr(185), '#mojihack3#', $body);
$body = mb_convert_encoding($body, "SJIS", "UTF-8");
$body = str_replace('#mojihack1#', chr(135).chr(138), $body);
$body = str_replace('#mojihack2#', chr(135).chr(139), $body);
$body = str_replace('#mojihack3#', chr(135).chr(140), $body);
------------------------------------------------------------------------
--
Edit this bug report at http://bugs.php.net/?id=33793&edit=1