Re: Surrogate support in *ML?

Mark Davis Thu, 07 Sep 2000 06:38:04 -0700

In HTML or XML you always use the code point (e.g. UTF-32), not a series of
code units (UTF-8 or UTF-16). Thus you would use:

&#x10123;

not &#xD800;&#xDD23; from UTF-16

nor &#xF0;&#x90;&#x84;&#xA3; from UTF-8

Mark

Brendan Murray/DUB/Lotus wrote:

> How can one encode a surrogate character as an entity in HTML/XML? Should
> it be as two separate characters or as one 32-bit value? In other words
> should it be:
>      &#xABCD;&#xEFGH;
> or
>      &#xABCDEFGH;
>
> Brendan

Re: Surrogate support in *ML?

Reply via email to