Re: std.experimental.collections.rcstring and its integration in Phobos

Seb via Digitalmars-d Wed, 18 Jul 2018 05:21:15 -0700

On Tuesday, 17 July 2018 at 18:09:13 UTC, Jonathan M Davis wrote:

On Tuesday, July 17, 2018 17:28:19 Seb via Digitalmars-d wrote:
On Tuesday, 17 July 2018 at 16:58:37 UTC, Jonathan M Daviswrote:
> [...]
Well, there are few cases where the range type doesn't matterand one can simply compare bytes, e.g.
equal (e.g. "ä" == "ä" <=> [195, 164] == [195, 164])
commonPrefix
find
...
That effectively means treating rcstring as a range of char bydefault rather than not treating it as a range by default. Andif we then do that only with functions that overload onrcstring rather than making rcstring actually a range of char,then why aren't we just treating it as a range of char ingeneral?
IMHO, the fact that so many alogorithms currently special-caseon arrays of characters is one reason that auto-decoding hasbeen a disaster, and adding a bunch of overloads for rcstringis just compounding the problem. Algorithms should properlysupport arbitrary ranges of characters, and then rcstring canbe passed to them by calling one of the functions on it to geta range of code units, code points, or graphemes to get anactual range - either that, or rcstring should default to beinga range of char. going halfway and making it work with somefunctions via overloads really doesn't make sense.

Well, the problem of it being a range of char is that this mightlead to very confusing behavior, e.g.


"ä".rcstring.split.join("|") == �|�

So we probably shouldn't go this route either.

The idea of adding overloads was to introduce a bit ofuser-convenience, s.t. they don't have to say


readText("foo".rcstring.by!char)

all the time.

You can still normalize with auto-decoding (the code units -and thus code points - are in a specific order even whenencoded, and that order can be normalized), and really, anyonewho wants fully correct string comparisons needs to benormalizing their strings. With that in mind, rcstring probablyshould support normalization of its internal representation.

It currently doesn't support this out of the box, but it's a veryvalid point and I added it to the list.

Re: std.experimental.collections.rcstring and its integration in Phobos

Reply via email to