https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120233
--- Comment #6 from Jakub Jelinek <jakub at gcc dot gnu.org> --- Although, for foo1 the optimization has been relying on SLP vectorization, without that we've been emitting two separate bswap32 calls.
