Re: "foreach(i, dchar c; s)" vs "decode"

monarch_dodra Sun, 25 Nov 2012 23:45:26 -0800

On Sunday, 25 November 2012 at 21:51:42 UTC, Jonathan M Daviswrote:

On Sunday, November 25, 2012 22:37:24 monarch_dodra wrote:
I got these results on 2.061 alpha release, with phobos in
release and both -inline and without inline.
You should also be testing with -O if you're benchmarking, butI still wouldhave thought that the compiler would be faster. Apparently not.I believe thatdefinite work has been put into improving the decode, stride,popFront, etc. inPhobos over the past year or two, so they've definitely beenimproving. Isuspect that whatever the compiler is doing hasn't been touchedin ages, and Ihave no idea what improvements could or couldn't be done. It_is_ the sort ofthing that I'd kind of expect to be sitting somewhere indruntime though. Ifit is, maybe foreach and Phobos' implemenations can be made toshare in someway. I don't know (though IMHO speed should be more importanthere than
reducing code duplication).
The speed of foreach's decoding definitely matters, but in thecode that I'vereally been trying to make fast, I don't generally use it,because it's oftenthe case that some portion of what I'm doing can be made fasterby skippingdecoding for some portion of the characters (like explicitlyhandling the codeunits for paraSep and lineSep in code that cares about the endof lines).Making string processing fast should definitely be one of ourperformancepriorities though IMHO given how big an impact that can have onmany programsand how unfriendly ranges generally are to efficient stringprocessing.
- Jonathan M Davis

Well, "-release -O" went without saying, but you are right tomention it, you never know.

Looking at 2.060 to 2.061, std.utf has changed a lot. I'll benchmy algo using the old implementation of 2.060 to see if thechange of performance could be related to that.

As you said, I found how some a "rt.util.utf" module in druntime,I was looking in the dmd tree. However, it is pretty much an oldversion of std.utf, verbatim...

Also, druntime has a *radically* different approach to stridingUTF-8. I'll try to see which approach is faster.

I'd have suggested we try some sort of code sharing, but now that"std.utf" supports range, the code has "forked" and I'm not sureis shareable anymore... Not without duplicating code insidestd.utf, or adding range support (or at least code) for decodingranges in druntime.

Well, I'll see what I can uncover, and update dmd utf in themeantime...

Re: "foreach(i, dchar c; s)" vs "decode"

Reply via email to