Viranga Ratnaike wrote:
>I was hunting for examples of japanese xml and came across the
>following, which looks rather cool.  Except that it doesn't seem
>to actually be unicode.  I thought XML had mandated unicode?
> http://java.sun.com/xml/jaxp-1.1/examples/samples/weekly-euc-jp.xml

Not at all! Any encoding can be used in XML documents. The requirement is
that the encoding is declared inside each document.

In fact, that document begins with:

        <?xml version="1.0" encoding="euc-jp" ?> 

"euc-jp" means the Japanese character set (JIS) serialized in EUC ("Extended
Unix Code"). EUC is what Unicoders would call a "transformation format", and
it is very popular with the three main CJK character sets (JIS=Japan,
GB=China, KCS=Korea).

_ Marco

Reply via email to