On Thu, 2022-09-01 at 16:46 +0000, Christophe Leroy wrote: > Surprisingly, I get worst performance with inline static call than > with > out of line static call:
I'm not sure what hackbench is doing, but when microbenchmarking 64 bit out-of-line calls in a loop I saw a similar thing where adding more indirection improved the performance despite doing more work. The cause seemed to be a combination of using older hardware and the target being too short (just an integer increment). Moving to a newer machine and adding a lot of NOPs to the target made the performance make sense.
signature.asc
Description: This is a digitally signed message part