Re: "A Programmer's Introduction to Unicode"

Janusz S. Bien Sun, 12 Mar 2017 12:07:12 -0700

Quote/Cytat - Manish Goregaokar <man...@mozilla.com> (Sun 12 Mar 201707:43:22 PM CET):

This is just another confirmation that the present Unicode terminology

is confusing.


I find this to be a symptom of our pedagogy around "characters" in
programming; most folks get taught that characters are bytes are code
points, especially because many languages try to make this the case.
The name "grapheme cluster" could be improved upon, but it's not the
primary source of this confusion.

I agree that it's not the primary source. However the pedagogy dependson the terminology used.

If the basic notion has to be referred in a cumbersome way as"extended grapheme cluster" then it is easier to talk about "Unicodecharacters" despite the fact that they have a rather loose relation toreal-life/user-perceived characters.


Best regards

Janusz

--

Prof. dr hab. Janusz S. Bień - Uniwersytet Warszawski (KatedraLingwistyki Formalnej)

Prof. Janusz S. Bień - University of Warsaw (Formal Linguistics Department)
jsb...@uw.edu.pl, jsb...@mimuw.edu.pl, http://fleksem.klf.uw.edu.pl/~jsbien/

Re: "A Programmer's Introduction to Unicode"

Reply via email to