Re: Updating D beyond Unicode 2.0

Abdulhaq via Digitalmars-d Sun, 23 Sep 2018 12:10:35 -0700

On Saturday, 22 September 2018 at 08:52:32 UTC, Jonathan M Daviswrote:

Honestly, I was horrified to find out that emojis were even inUnicode. It makes no sense whatsover. Emojis are supposed to besequences of characters that can be interepreted as images.Treating them like Unicode symbols is like treating entirewords like Unicode symbols. It's just plain stupid and a clearsign that Unicode has gone completely off the rails (if it wasever on them). Unfortunately, it's the best tool that we havefor the job.

According to the Unicode website,http://unicode.org/standard/WhatIsUnicode.html,

"""

Support of Unicode forms the foundation for the representation oflanguages and symbols in all major operating systems, searchengines, browsers, laptops, and smart phones—plus the Internetand World Wide Web (URLs, HTML, XML, CSS, JSON, etc.)"""


Note, unicode supports symbols, not just characters.

The smiley face symbol predates its ':-)' usage in ascii text,https://www.smithsonianmag.com/arts-culture/who-really-invented-the-smiley-face-2058483/. It's fundamentally a symbol, not a sequence of characters. Therefore it is not unreasonable for it to be encoded with a unicode number. I do agree though, of course, that it would seem bizarre to use an emoji as a D identifier.

The early history of computer science is completely dominated bycultures who use latin script based characters, and hence, quietreasonably, text encoding and its automated visual representationby compute based devices is dominated by the requirements oflatin script languages. However, the world keeps turning and,despite DT's best efforts, China et al. look to become dominant.Even if not China, the chances are that eventually a non-latinscript based language will become very important. Parochial viewslike "all open source code should be in ASCII" will look silly.

However, until that time D developers have to spend their timewhere it can be most useful. Hence the condition of whether toapply Neia's patch / ideas or not mainly depends on how mucheffort the donwstream effort will be (debuggers etc. as Walterpointed out), and how much the gain is. As unicode 2.0 is alreadysupported I would take a guess that the vast majority of peoplewith access to a computer can already enter identifiers in D thatare rich enough for them. As Adam said though, it would be a goodidea to at least ask!

Re: Updating D beyond Unicode 2.0

Reply via email to