Re: Proposal for fixing dchar ranges

John Colvin Mon, 10 Mar 2014 13:56:34 -0700

On Monday, 10 March 2014 at 20:00:07 UTC, Steven Schveighofferwrote:

On Mon, 10 Mar 2014 15:30:00 -0400, John Colvin<[email protected]> wrote:
On Monday, 10 March 2014 at 18:09:51 UTC, Steven Schveighofferwrote:
Because one can slice out a multi-code-unit code point, onecannot access it via index. Strings would be horriblycrippled without slicing. Without indexing, they are fine.
A possibility is to allow index, but actually decode the codepoint at that index (error on invalid index). That mightactually be the correct mechanism.
In order to be correct, both require exactly the sameknowledge: The beginning of a code point, followed by the endof a code point. In the indexing case they just happen to bethe same code-point and happen to be one code unit from eachother. I don't see how one is any more or less errror-prone orfundamentally wrong than the other.
Using indexing, you simply cannot get the single code unit thatrepresents a multi-code-unit code point. It doesn't fit in achar. It's guaranteed to fail, whereas slicing will give youaccess to the all the data in the string.

I think I understand your motivation now. Indexing never providesanything that slicing doesn't do more generally.

Now, with indexing actually decoding a code point, one canalias a[i] to a[i..$].front(), which means decode the firstcode point you come to at index i. This means indexing isslow(er), and returns a dchar. I think as a first step, thatmight be too much to add silently. I'd rather break it first,then add it back later.
-Steve

Of course that i has to be at the beginning of a code-point.Doesn't seem like that useful a feature and potentially veryconfusing for people who naively expect normal indexing.

Re: Proposal for fixing dchar ranges

Reply via email to