On 01/01/2012 08:10 AM, Don wrote:
On 31.12.2011 17:13, Timon Gehr wrote:
On 12/31/2011 01:15 PM, Don wrote:
On 31.12.2011 01:56, Timon Gehr wrote:
On 12/31/2011 01:12 AM, Andrei Alexandrescu wrote:
On 12/30/11 6:07 PM, Timon Gehr wrote:
alias std.string.representation raw;

I meant your implementation is incomplete.

It was more a sketch than an implementation. It is not even type safe
:o).


But the main point is that presence of representation/raw is not the
issue.
The availability of good-for-nothing .length and operator[] are
the issue. Putting in place the convention of using .raw is hardly
useful within the context.


D strings are arrays. An array without .length and operator[] is close
to being good for nothing. The language specification is quite clear
about the fact that e.g. char is not a character but an utf-8 code
unit.
Therefore char[] is an array of code units.

No, it isn't. That's the problem. char[] is not an array of char.
It has an additional invariant: it is a UTF8 string. If you randomly
change elements, the invariant is violated.

char[] is an array of char and the additional invariant is not enforced
by the language.

No, it isn't an ordinary array. For example with concatenation. char[] ~
int will never create an invalid string.

Yes it will.

void main() {
    char[] x;
    writeln(x~255);
}

You can end up with multiple chars being appended, even from a single append. 
foreach is different,
too. They are a bit magical.

Fair enough, but type conversion rules are a bit magical in general.

void main() {
    auto a = cast(short[])[1,2,3];
    auto b = [1,2,3];
    auto c = cast(short[])b;
    assert(a!=c);
}

There's quite a lot of code in the compiler to make sure that strings
remain valid.


At the same time, there are many language features that allow to create invalid strings.

auto a = "\377\252\314";
auto b = x"FF AA CC";
auto c = import("binary");

The additional invariant is not enforced in the case of slicing; that's
the point.

Reply via email to