On Fri, 28 Oct 2022 20:19:35 GMT, vpaprotsk <d...@openjdk.org> wrote:
> And just looking now on uops.info, they seem to have identical timings? Actual instruction being used (aligned vs unaligned versions) doesn't matter much here, because it's a dynamic property of the address being accessed: misaligned accesses that cross cache line boundary incur a penalty. Since the cache line size is 64 byte in size, every misaligned 512-bit access is penalized. ------------- PR: https://git.openjdk.org/jdk/pull/10582