Re: [phobos] UTF-8 string slicing

Walter Bright Sat, 20 Aug 2011 08:00:37 -0700


unDEFER wrote:

On Sat, 20 Aug 2011 06:49:33 +0400, Walter Bright<[email protected]> wrote:
There isn't any getting away from understanding that UTF-8 is amulti-byte encoding.
If it is so, then arr.popFront() must break UTF-8 strings ;-)
If you want to use an encoding with a 1:1 correspondence betweenindices and characters, use dchar encoding.
For me use in 4 times more memory for ASCII seems too wasteful, sorry.

Exactly - all I'm saying is that if you want the benefits of UTF-8 - lowmemory consumption *and* high speed processing, you have to be cognizantof its underlying storage scheme. In order to get a higher level of "Idon't care how it is stored, I just want to pretend it's an array ofUnicode characters", you'll have to give up one or more of efficiencyand memory consumption.

Walter, I really very like your creation. It is great. Big thank youfor it!
I really believe that there is no bugs, only not documented features ;-)
I just want to say that the documentation now give enough information.
std.range or std.array documentation don't say anything about it'sbehaviour on UTF-8 strings.I'm already see source codes to know what really does any function.Open Source is really great :-)


I agree, open source can make up for gaps in the documentation.
_______________________________________________
phobos mailing list
[email protected]
http://lists.puremagic.com/mailman/listinfo/phobos

Re: [phobos] UTF-8 string slicing

Reply via email to