Re: Range of chars (narrow string ranges)

Jonathan M Davis via Digitalmars-d Fri, 24 Apr 2015 16:36:07 -0700

On Friday, 24 April 2015 at 20:44:34 UTC, Walter Bright wrote:

On 4/24/2015 11:52 AM, H. S. Teoh via Digitalmars-d wrote:
I really wish we would just *make the darn decision* already,whether tokill off autodecoding or not, and MAKE IT CONSISTENT ACROSSPHOBOS,instead of introducing this schizophrenic dichotomy where somefunctionsgive you a range of dchar while others give you a range ofchar/wchar,and the two don't work well together. This is totally going tomake a
laughing stock of D one day.
Some facts:
1. When I started D, there was a lot of speculation aboutwhether the world would settle on UTF8, UTF16, or UTF32. So Dsupports natively all three. Time has shown, however, that UTF8has pretty much won. wchar only exists for Windows API andJava, dchar strings pretty much don't exist in the wild.
2. dchar is very useful as a character type, but not as astring type.
3. Pretty much none of the algorithms in Phobos work whenpresented with a range of chars or wchars. This is not evendocumented.
4. Autodecoding is inefficient, especially considering that fewalgorithms actually need decoding. Re-encoding the result backto UTF8 is another inefficiency.
I'm afraid we are stuck with autodecoding, as taking it out maybe far too disruptive.
But all is not lost. The Phobos algorithms can all be fixed tonot care about autodecoding. The changes I've made tostd.string all reflect that.
https://github.com/D-Programming-Language/phobos/pulls/WalterBright

I really think that leaving things with autodecoding in somecases and not in others is just asking for trouble. Even if wemanage to figure out how to fix it so that Phobos doesn'tautodecode in any of its algorithms without breaking any usercode in the process, that then leaves user code with the problem,and since Phobos _wouldn't_ have the problem, it then would beall the more confusing.

It _is_ possible to get rid of it entirely without breaking codeif we move the array range primitives to a new module and laterdeprecate the old ones, though that would probably mean breakingup std.array into submodules and deprecating _all_ of it in favorof its submodules, since anyone importing std.array would thenhave the old array range primitives rather than the new ones - orboth, causing conflicts. And it's made worse by the fact thatstd.range publicly imports std.array. So, yes, it _is_ ugly. Butit _can_ be done.

If we leave autodecoding in and just work around it everywhere inPhobos, it's just going to forever screw with user code andconfuse users. They get confused enough by it as it is, and atleast now, they're running into it in Phobos where we can explainit, whereas if they don't see it with Phobos and only with theirown code, then they're going to think that they're doingsomething wrong and potentially get very frustrated.

I definitely share the concern that removing autodecodingoutright will be too disruptive, but at the same time, I don'tknow if we can afford to go halfway with it.

Re: Range of chars (narrow string ranges)

Reply via email to