Re: [fpc-devel] Question about memory alignment (again!)

Martin Frb via fpc-devel Wed, 17 Aug 2022 02:03:16 -0700

On 17/08/2022 02:21, J. Gareth Moreton via fpc-devel wrote:

Hi everyone,
Recently I've made some optimisations centred around the SHRinstruction on x86, and there was one pair of instructions that caughtmy attention:
movl (%rbx),%eax
shrl $24,%eax

Is it permissible to optimise this to (x86 is little-endian):

movzbl 3(%rbx),%eax?
(You could also optimise "movl; sarl" into a "movsbl" instruction thisway)
Logically the result is the same and it removes an instruction and apipeline stall, but will there be a performance hit that comes fromreading an unaligned byte of memory like that?


Doesn't shr set the carry flag to the former bit 23? (the last shifted out)

So its not the same, unless there is no dependency on the carry flaglater on.

I did make similar optimisation once before with QWords using theimplicit zero-extension of the 32-bit MOV instruction - that is:
movq (%rbx),%rax
shrq $32,%rax

To:

movl 4(%rbx),%eax
This one is a little nicer though because it's still on a 32-bitboundary and so was permissible.


Same issue?
_______________________________________________
fpc-devel maillist  -  fpc-devel@lists.freepascal.org
https://lists.freepascal.org/cgi-bin/mailman/listinfo/fpc-devel

Re: [fpc-devel] Question about memory alignment (again!)

Reply via email to