From: "Doug Ewell" <[EMAIL PROTECTED]>
My impression is that Unicode and ISO/IEC 10646 are two distinct
standards, administered respectively by UTC and ISO/IEC JTC1/SC2/WG2,
which have pledged to work together to keep the standards perfectly
aligned and interoperable, because it would be destructiv
;[EMAIL PROTECTED]>; <[EMAIL PROTECTED]>
Sent: Thursday, November 25, 2004 13:59
Subject: Relationship between Unicode and 10646 (was: Re: Shift-JIS
conversion.)
> Philippe Verdy wrote:
>
> >> UTF-8 is an encoding of the Unicode character set.
> >
&g
Philippe Verdy wrote:
>> UTF-8 is an encoding of the Unicode character set.
>
> More exactly, UTF-8 is an encoding of the ISO/IEC 10646 character
> set...
>
> Unicode by itself is not a character set, only an implementation of
> the ISO/IEC 10646 character set...
>
> Of course the Unicode technic
- Original Message -
From: Addison Phillips [wM]
To: pragati ; [EMAIL PROTECTED]
Sent: Thursday, November 25, 2004 6:21 PM
Subject: RE: Shift-JIS conversion.
Dear Pragati,
You can write your own conversion, of course. The mapping tables of
Unicode->SJIS are readily availably. You sho
You just need a mapping table from Unicode
codepoints to Shift-JIS code positions, and a very simple code point parser to
translate UTF-8 into Unicode code points.
You'll find a mapping table in the Unicode UCD, on
its FTP server. The UTF-8 form is fully documented in the Conformance section
ï
Dear Pragati,
You can write your own conversion, of course. The mapping tables of
Unicode->SJIS are readily availably. You should note that there are several
vendor specific variations in the mapping tables. Notably Microsoft code page
932, which is often called Shift-JIS, has more charact
6 matches
Mail list logo