Nick Sabalausky:

> 3. A text editor, for instance, is intended to treat something like (U+305D, 
> U+3099) as a single character, right?

Languages are a product of biology, and in biology it's usually hard to put 
absolute limits between things; all definitions must be flexible and a little 
fuzzy if they want to grasp enough of the reality and be useful. So I think the 
answer to this question is positive.
When you iterate with D foreach on a string that contains those, what is the 
right way to split chars? Returning a single "char" 8 bytes long (that is a 
string of two 32-bit chars) that contains them both is not wrong (but probably 
not expected) :-)

Bye,
bearophile

Reply via email to