https://gcc.gnu.org/bugzilla/show_bug.cgi?id=101639
--- Comment #10 from Hongtao Liu <liuhongt at gcc dot gnu.org> ---
clang generates
avx512:
f(int*, long):
vmovdqu xmm0, xmmword ptr [rdi]
vptestnmd k0, xmm0, xmm0
kortestb k0, k0
sete al
ret
avx2:
f(int*, long):
vpxor xmm0, xmm0, xmm0
vpcmpeqd xmm0, xmm0, xmmword ptr [rdi]
vmovmskps eax, xmm0
test eax, eax
sete al
ret
Maybe GCC can reuse cstorem4 similar as cbranchm4 for those mask.
