https://gcc.gnu.org/bugzilla/show_bug.cgi?id=91735
--- Comment #10 from Richard Biener <rguenth at gcc dot gnu.org> --- Can't really decipher what clang does here. it seems to handle even/odd lanes separately, doing 24 vpextrb stores per loop iteration. Possibly simply an interleaving scheme...