Re: Updating D beyond Unicode 2.0

Steven Schveighoffer via Digitalmars-d Mon, 24 Sep 2018 06:50:49 -0700

On 9/22/18 12:56 PM, Neia Neutuladh wrote:

On Saturday, 22 September 2018 at 12:35:27 UTC, Steven Schveighoffer wrote:
But aren't we arguing about the wrong thing here? D already acceptsnon-ASCII identifiers.
Walter was doing that thing that people in the US who only speak Englishtend to do: forgetting that other people speak other languages, and thatpeople who speak English can learn other languages to work with peoplewho don't speak English.

I don't think he was doing that. I think what he was saying was, D triedto accommodate users who don't normally speak English, and they stilluse English (for the most part) for coding.

I'm actually surprised there isn't much code out there that is writtenwith other identifiers besides ASCII, given that C99 supported them. Iassumed it was because they weren't supported. Now I learn that they aresupported, yet almost all C code I've ever seen is written in English.Perhaps that's just because I don't frequent foreign language sitesthough :) But many people here speak English as a second language, andvouch for their cultures still using English to write code.

He was saying it's inevitably a mistake to usenon-ASCII characters in identifiers and that nobody does use them inpractice.

I would expect people probably do try to use them in practice, it's justthat the problems they run into aren't worth the effort(tool/environment support). But I have no first or even second handexperience with this. It does seem like Walter has a lot of experiencewith it though.

Walter talking like that sounds like he'd like to remove support fornon-ASCII identifiers from the language. I've gotten by withoutmaintaining a set of personal patches on top of DMD so far, and I'd likeit if I didn't have to start.

I don't think he was saying that. I think he was against expandingsupport for further Unicode identifiers because the the first effort didnot produce any measurable benefit. I'd be shocked from the recentpositions of Walter and Andrei if they decided to remove non-ASCIIidentifiers that are currently supported, thereby breaking any existingcode.

What languages need an upgrade to unicode symbol names? In otherwords, what symbols aren't possible with the current support?
Chinese and Japanese have gained about eleven thousand symbols sinceUnicode 2.
Unicode 2 covers 25 writing systems, while Unicode 11 covers 146. Justupdating to Unicode 3 would give us Cherokee, Ge'ez (multiplelanguages), Khmer (Cambodian), Mongolian, Burmese, Sinhala (Sri Lanka),Thaana (Maldivian), Canadian aboriginal syllabics, and Yi (Nuosu).

Very interesting! I would agree that we should at least add support forunicode symbols that are used in spoken languages, especially if wealready have support for symbols that aren't ASCII already. I don't seethe downside, especially if you can already use Unicode 2.0 symbols foridentifiers (the ship has already sailed).

It could be a good incentive to get kids in countries where Englishisn't commonly spoken to try D out as a first programming language ;)Using your native language to show example code could be a huge benefitfor teaching coding.

My recommendation is to put the PR up for review (that you said you hadready) and see what happens. Having an actual patch to talk about couldchange minds. At the very least, it's worth not wasting your effortsthat you have already spent. Even if it does need a DIP, the PR can showthat one less piece of effort is needed to get it implemented.


-Steve

Re: Updating D beyond Unicode 2.0

Reply via email to