Re: kill the commas! (phobos code cleanup)

via Digitalmars-d Sun, 07 Sep 2014 03:51:02 -0700

On Sunday, 7 September 2014 at 10:29:41 UTC, ketmar viaDigitalmars-d wrote:

index nth symbol! ucs-4 (aka dchar/dstring) is ok though.

For western text strings utf-8 is much better due to cacheefficiency. You can speed it up using SSE or dedicateddatastructures.

The point of having unique immutable strings is that they compareby reference only and that you can have auxillary datastructuresthat classify them if needed.

I think the D approach to strings is unpleasant. You should nothave slices of strings, only slices of ubyte arrays.

If you want real speedups for streams of symbols you have to moveinto the landscape of huffman-encoding, tries, dedicateddatastructures…

Having uniform string support in libraries (i.e. only supportingutf-8) is a clear advantage IMO, that will allow for APIs thatare SSE backed and performant.

Re: kill the commas! (phobos code cleanup)

Reply via email to