Re: [fpc-devel] Tail recursion optimization

Jonas Maebe Tue, 10 Oct 2006 05:12:29 -0700


On 10 okt 2006, at 13:46, Florian Klaempfl wrote:

Practical argument:
the assembler code _is_ better for the code I tested and up to astwice as fast
as the original one.

That's indeed true for an extremely small function with so few localvariables/parameters that even on an i386 it doesn't need spilling inthe absence of register allocation optimizations. That's why I saidit *may* currently degrade performance in more complex functions (italso may not, I really don't know, it was just a remark).

Theoretical argument:
- using the tail goto you've one function with one set of variablesbeing active
across the the whole function
- using a recursive call you've at least two sets of variables: thecaller andthe callee ones. Though one set is only active at a limited part ofthefunction, the set of the caller is still in use while the callee iscalled
though they (the caller variables) are spilled.

The first point is just as much a downside as an upside in thecurrent situation, because parameters/variables which are usedwithout being destroyed by some function call can normally be put ina (reusable) volatile register. Now they need a non-volatile registerduring the entire function.

The fact that you get such a speedup is in my view mainly anindication of the fact that the function barely does anything, andthat almost half the time is spent in setting up and tearing downstack frames. So it's logical that register allocation has littleinfluence and that optimizations which remove this stack frame logichelp a lot.

I guess it could be used to change to code above to
leal 1(%esi),%eax
?


Yes.

Or is this slower on some CPUs?


I don't know.


Jonas
_______________________________________________
fpc-devel maillist  -  fpc-devel@lists.freepascal.org
http://lists.freepascal.org/mailman/listinfo/fpc-devel

Re: [fpc-devel] Tail recursion optimization

Reply via email to