Re: inlining...

John Colvin Fri, 14 Mar 2014 05:06:24 -0700

On Friday, 14 March 2014 at 11:04:34 UTC, Manu wrote:

On 14 March 2014 18:03, John Colvin<[email protected]> wrote:
As much as I like the idea:
Something always tells me this is the compilers job... Whatcleverreasoning are you applying that the compiler's inliner can't?It seems likea different situation to say SIMD code, where correctlystructuring loopscan require a lot of gymnastics that the compiler can't orwon't (floatingpoint conformance) do. The inlining decision seems easilyautomatable in
comparison.
I understand that unoptimised builds for debugging are aproblem, but a
sensible compiler let's you hand pick your optimisation passes.
In short: why are compilers not good enough at this that theprogrammer
needs to be involved?
The compiler applies generalised heuristics, which arecertainly for the
'common' case, whatever that happens to be.
The compiler simply doesn't know what you're doing, so it'svery hard for
the compiler to do anything really intelligent.
Inlining heuristics are fickle, and they also don't know whatyou're
actually trying to do.
Is a function 'long'? How long is 'long'? Is the function'hot'? Do weprefer code size or execution speed? Is the function calledonly from this
location, or is it used in many locations? Etc.
Inlining is one of the most fuzzy pieces of logic in thecompiler, andrelies on a lot of information that is impossible for thecompiler todeduce, so it applies heuristics to try and do a decent job,but it's
certainly not perfect.
I argue, nothing so fickle can exist in the language withouthaving a
manual override. Especially not in a native language.
In my current case, the functions I need to inline are notexactly trivial.They're really pushing the boundaries of the compilers inlinerheuristics,and then I'm calling a series of such functions that operate onparallel
data.
If they don't inline, the performance equals the sum of thefunctions plussome overhead. If they all inline, the performance is equal toonly the
longest one, and no overhead (the others fill in pipeline gaps).
Further, some of these functions embed some shared work... ifthey don'tinline, this work is repeated. If they do inline, the redundantrepeated
work is eliminated.
My experiments with std.algorithm were a failure. I realisedquickly that Icouldn't rely on the inliner to do a satisfactory job, and theoptimiser
was unable to do it's job properly.
std.algorithm could really benefit from the mixin suggestionsince thingslike predicate functions are always trivial, usually suppliedas littlelambdas, and inlining isn't reliable. Especially in the debugbuilds.Something like algorithm loop sugar shouldn't run heaps worsethan anexplicit loop just because it happens to be implemented by ageneric
function.


Thanks for the explanations.

Another use case is to aid propogation of compile-timeinformation for optimisation.A function might look like a poor candidate for inlining forother reasons, but if there's a statically known (to the caller)integer parameter coming in that will be used to decide a looplength, inlining allows that info to be propogated to the callee.Static loop lengths => well optimised loops, with opportunitiesfor optimal unrolling. Even with quite a large function this canbe a good choice to inline.

I don't know how good compilers are at taking this sort of thinginto account already.

Re: inlining...

Reply via email to