On Friday, 7 March 2014 at 04:11:15 UTC, Nick Sabalausky wrote:
What about this?:

Anywhere we currently have a front() that decodes, such as your example:

@property dchar front(T)(T[] a) @safe pure if (isNarrowString!(T[]))
  {
assert(a.length, "Attempting to fetch the front of an empty array
of " ~
           T.stringof);
    size_t i = 0;
    return decode(a, i);
  }


We rip out that front() entirely. The result is *not* technically a range...yet! We could call it a protorange.

Then we provide two functions:

auto decode(someStringProtoRange) {...}
auto raw(someStringProtoRange) {...}

These convert the protoranges into actual ranges by adding the missing front() function. The 'decode' adds a front() which decodes into dchar, while the 'raw' adds a front() which simply returns the raw underlying type.

I imagine the decode/raw would probably also handle any "length" property (if it exists in the protorange) accordingly.

This way, the user is forced to specify "myStringRange.decode" or "myStringRange.raw" as appropriate, otherwise myStringRange can't be used since it isn't technically a range, only a protorange.

(Naturally, ranges of dchar would always have front, since no decoding is ever needed for them anyway. For these ranges, the decode/raw funcs above would simply be no-ops.)

Strings can be iterated over by code unit, code point, grapheme, grapheme cluster (?), words, sentences, lines, paragraphs, and potentially other things. Therefore, it makes sense two require the same for ranges of dchar, too.

Also, `byCodeUnit` and `byCodePoint` would probably be better names than `raw` and `decode`, to much the already existing `byGrapheme` in std.uni.

Reply via email to