https://gcc.gnu.org/bugzilla/show_bug.cgi?id=54803
vekumar at gcc dot gnu.org changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |vekumar at gcc dot gnu.org --- Comment #5 from vekumar at gcc dot gnu.org --- On bdver4 when we enable -march=bdver4 and -mno-prefer-avx128 vectorizes using YMM Otherwise uses vprotq instruction. .L13: vmovdqa (%r8,%r9), %ymm0 incq %rax vpsrlq $32, %ymm0, %ymm1 vpsllq $32, %ymm0, %ymm0 vpor %ymm0, %ymm1, %ymm0 vmovdqa %ymm0, (%rdx,%r9) addq $32, %r9 cmpq %rax, %r10 ja .L13