Tomohiro KUBOTA wrote on 2002-04-03 07:51 UTC:
The compatibility characters are there in Unicode to allow you to chose
to either use the unification rules of JIS or the unification rules of
the IRG, at your choice.
For example, IBM's table says that:
SJISUCS
8A436D77 (in CJK
Michael B. Allen wrote on 2002-04-03 08:21 UTC:
I'm assuming you don't have a specific application in mind, since you
didn't answer Markus's question.
Ok, here's an example. The Document Object Model W3C spec describes some
'CharacterData' methods:
Michael B. Allen wrote on 2002-04-03 19:47 UTC:
Whenever you are dealing with a variable-length encoding of characters,
you really don't want to specify anything in terms of a number of
characters.
So you should not use a variable-length encoding for any serious generic
string
Hi folks,
FYI - from Martin Duerst (W3C I18N guru).
Cheers,
- Ira McDonald
High North Inc
-Original Message-
From: Martin Duerst [mailto:[EMAIL PROTECTED]]
Sent: Wednesday, April 03, 2002 2:33 AM
To: [EMAIL PROTECTED]
Subject: charlint updated for Unicode 3.2.0
The perl-based