Yes the doc says that. See cp50221 which adds the kana and cp 50222 -----Original Message----- From: Moriyoshi Koizumi [mailto:m...@mozo.jp] Sent: Wednesday, February 03, 2010 1:47 AM To: Tex Texin Cc: KITAZAKI Shigeru; php-i18n@lists.php.net Subject: Re: [PHP-I18N] adding GB18030 support for mbstring
Note that CP50220 isn't identical to ISO-2022-JP-MS despite the naming. JISX0201 katakanas are not allowed by CP50220 Moriyoshi On 2/3/10, Tex Texin <texte...@xencraft.com> wrote: > Yes, microsoft doc is often loose with respect to encodings. > E.g. they claim 932 and shift-jis are the same when they aren't, etc. > > I'll look for confirmation from Kitazaki-san that > ISO_2022,locale=ja,version=3 works. > > tex > -----Original Message----- > From: Moriyoshi Koizumi [mailto:m...@mozo.jp] > Sent: Tuesday, February 02, 2010 11:32 PM > To: Tex Texin > Cc: KITAZAKI Shigeru; php-i18n@lists.php.net > Subject: Re: [PHP-I18N] adding GB18030 support for mbstring > > That is not correct. .NET Names here are also used intenally in MS > products as well as codepages, and doesn't necessarily reflect the > actual codeset defined in the IANA charset if the names look the same. > Look at "additional information" for the differences. > > Moriyoshi > > On Wed, Feb 3, 2010 at 4:16 PM, Tex Texin <texte...@xencraft.com> wrote: >> Yes- 50220 is just normal ISO-2022-JP: >> http://msdn.microsoft.com/en-us/library/dd317756(VS.85).aspx >> >> >> -----Original Message----- >> From: Moriyoshi Koizumi [mailto:m...@mozo.jp] >> Sent: Tuesday, February 02, 2010 10:54 PM >> To: KITAZAKI Shigeru >> Cc: php-i18n@lists.php.net >> Subject: Re: [PHP-I18N] adding GB18030 support for mbstring >> >> It just turned out ISO_2022,locale=ja,version=3 is actually > ISO-2022-JP-MS. >> >> Moriyoshi >> >> On Wed, Feb 3, 2010 at 10:22 AM, Moriyoshi Koizumi <m...@mozo.jp> wrote: >>> None of them can handle CP50220. >>> >>> Moriyoshi >>> >>> 2010/2/3 Tex Texin <texte...@xencraft.com>: >>>> icu has at least 5 versions of iso 2022-jp. >>>> >>>> http://demo.icu-project.org/icu-bin/convexp >>>> >>>> If the one you refer to is not one of these send me the details and I'll >> log >>>> it with the icu team. >>>> >>>> tex >>>> >>>> >>>> -----Original Message----- >>>> From: KITAZAKI Shigeru [mailto:shigeru_kitaz...@cybozu.co.jp] >>>> Sent: Tuesday, February 02, 2010 4:43 AM >>>> To: Moriyoshi Koizumi >>>> Cc: php-i18n@lists.php.net >>>> Subject: Re: [PHP-I18N] adding GB18030 support for mbstring >>>> >>>> Koizumi-san >>>> >>>> Let me tell you the one concern about mbstring-ng. >>>> The current mbstring supports 'ISO-2022-JP-MS', this is different from >>>> 'ISO-2022-JP'. And the current implementation of ICU can not convert >>>> between ISO-2022-JP-MS and unicode correctly, I guess. >>>> For example, Japanese hankaku katakana, GA, A with a sonant mark. >>>> >>>> Although it's better way to modify ICU itself, it takes long time. >>>> How do you think of this? >>>> >>>> Moriyoshi Koizumi wrote: >>>>> BTW, I created an extension that is near-compatible with mbstring and >>>>> based on ICU that of course supports GB18030. See >>>>> http://github.com/moriyoshi/mbstring-ng for detail. >>>>> >>>> >>>> Regards, >>>> Shigeru >>>> >>>> -- >>>> PHP Unicode & I18N Mailing List (http://www.php.net/) >>>> To unsubscribe, visit: http://www.php.net/unsub.php >>>> >>>> >>> >> >> -- >> PHP Unicode & I18N Mailing List (http://www.php.net/) >> To unsubscribe, visit: http://www.php.net/unsub.php >> >> > > -- PHP Unicode & I18N Mailing List (http://www.php.net/) To unsubscribe, visit: http://www.php.net/unsub.php