On Sat, 6 Apr 2024 at 04:38, Nathan Bossart <nathandboss...@gmail.com> wrote: > This seems to provide a small performance boost, so I've incorporated it > into v27.
Won't Valgrind complain about this? +pg_popcount_avx512(const char *buf, int bytes) + buf = (const char *) TYPEALIGN_DOWN(sizeof(__m512i), buf); + val = _mm512_maskz_loadu_epi8(mask, (const __m512i *) buf); David