Re: Strange counter-performance in an alternative `decimalLength9` function

Bruce Carneal via Digitalmars-d-learn Fri, 28 Feb 2020 02:36:23 -0800

On Friday, 28 February 2020 at 10:11:23 UTC, Bruce Carneal wrote:

On Friday, 28 February 2020 at 06:50:55 UTC, 9il wrote:
On Wednesday, 26 February 2020 at 00:50:35 UTC, Basile B.wrote:
So after reading the translation of RYU I was interested toosee if the decimalLength() function can be written to befaster, as it cascades up to 8 CMP.
[...]
bsr can be done in one/two CPU operation, quite quick. Butcore.bitop.bsr wouldn't be inlined. Instead, mir-core(mir.bitop: ctlz) or LDC intrinsics llvm_ctlz can be used forto get code with inlining.
That's surprising. I just got ldc to inline core.bitop.bsr onrun.dlang.io using ldc -O3 -mcpu=native. (not sure what thetarget CPU is)
Under what conditions should I be guarding against an inliningfailure?


Here's the code I used:

int main(string[] args)
{
    import core.bitop : bsr;
    return bsr(cast(uint)args.length);
}

BTW, I'm a huge fan of your performance work.

Re: Strange counter-performance in an alternative `decimalLength9` function

Reply via email to