Re: Proposal for fixing dchar ranges

monarch_dodra Wed, 12 Mar 2014 02:10:24 -0700

On Tuesday, 11 March 2014 at 18:02:26 UTC, Steven Schveighofferwrote:

No, where we are today is that in some cases, the languagetreats a char[] as an array of char, in other cases, it treatsa char[] as a bi-directional dchar range.
-Steve

I want to mention something I've had trouble with recently, thatI haven't seen mentioned yet, but is related:


The ambiguity of the "lone char".

By that I mean: When a function accepts 'char' as an argument, itis (IMO) very hard to know if it is actually accepting a?

1. An ascii char in the 0 .. 128 range?
2. A code unit?

3. (heaven forbid) a codepoint in the 0 .. 256 range packed intoa char?

Currently (fortuantly? unfortunatly?) the current choice taken inour algorithms is 3, which is actually the 'safest' solution.


So if you write:
find("cassé", cast(char)'é');

It *will* correctly find the 'é', but it *won't* search for it inindividual codeunits.


--------

Another more pernicious case is that of output ranges. "put" issupposed to know how to convert and string/char width, into anysting/char width.

Again, things become funky if you tell "put" to place a string,into a sink that accepts a char.


Is the sink actually telling you to feed it code units? or ascii?

Re: Proposal for fixing dchar ranges

Reply via email to