On Mon, Mar 26, 2012 at 2:02 PM, Gavin Barraclough <[email protected]>wrote:

> I really like the direction you're going in, but have one minor concern
> relating to regular expressions.
>
> In your proposal, you currently state:
>        "A code unit that is in the range 0xD800 to 0xDFFF, but is not part
> of a surrogate pair, is interpreted as a code point with the same value."
>

Just as a reminder, this would be in explicit violation of the Unicode
conformance clause C1 unless it can be guaranteed that such a code point
will not be interpreted as an abstract character:

C1 A process shall not interpret a high-surrogate code point or a
low-surrogate code point as an abstract character.

[1] http://www.unicode.org/versions/Unicode6.1.0/ch03.pdf

Given that such guarantee is likely impractical, this presents a problem
for the above proposed language.
_______________________________________________
es-discuss mailing list
[email protected]
https://mail.mozilla.org/listinfo/es-discuss

Reply via email to