On 01/14/2011 02:37 PM, Steven Schveighoffer wrote:

* I don't even know how to make a grapheme that is more than one
code-unit, let alone more than one code-point :)  Every time I try, I
get 'invalid utf sequence'.

I feel significantly ignorant on this issue, and I'm slowly getting
enough knowledge to join the discussion, but being a dumb American who
only speaks English, I have a hard time grasping how this shit all works.

1. See my text at https://bitbucket.org/denispir/denispir-d/src/c572ccaefa33/U%20missing%20level%20of%20abstraction

2.
    writeln ("A\u0308\u0330");
<A + tilde above + umlaut below> (or the opposite)
If it does not display properly, either set your terminal to UTF* or use a more unicode-aware font (eg DejaVu series).

The point is not playing like that with Unicode flexibility. Rather that composite characters are just normal thingies in most languages of the world. Actually, on this point, english is a rare exception (discarding letters imported from foreign languages like french 'à'); to the point of beeing, I guess, the only western language without any diacritic.


Denis
_________________
vita es estrany
spir.wikidot.com

Reply via email to