On 9/21/11 10:16 AM, Christophe wrote:
Timon Gehr , dans le message (digitalmars.D:144889), a écrit :
unicode natively. Yet the 'D strings are strange and confusing' argument
comes up quite often on the web.

Well, I think they are. The ptr+length stuff is amasing, but the
behavior of strings in phobos is weird.

mini-quiz: what should std.range.drop(some_string, 1) do ?
hint: what it actually does is not what the documentation of phobos
suggests*...

Strings are array of char, but they appear like a lazy range of dchar to
phobos. I could cope with the fact that this is a little unexpected for
beginners. But well, that creates a lot of exceptions in phobos, like
the fact that you can't even copy a char[] to a char[] with
std.algorithm.copy. And I don't mention all the optimization that are
not/cannot be performed for those strings. I'll just remember to use
ubyte[] wherever I can...

String handling in D is good modulo the oddities you noticed. What would make it perfect would be:

* Add property .rep that returns byte[], ushort[], or uint[] for char[], wchar[], dchar[] respectively (with the appropriate qualifier).

* Replace .length with .codeUnits.

* Disallow [n] and [m .. n]

This would upgrade D's strings from good to awesome. Really it would be a dream come true. Unfortunately it would also break most D code there is out there. I don't see how we can improve the current situation while staying backward compatible.


Andrei

Reply via email to