https://gcc.gnu.org/bugzilla/show_bug.cgi?id=85694
--- Comment #2 from Richard Biener <rguenth at gcc dot gnu.org> --- Hmm, but if you have 255 + 255 + 1 then you need to use pavgw at least, otherwise the vectorization isn't semantically equivalent? Or do the instructions compute the intermediate results in greater precision than 8 bits? The specification doesn't seem to tell. Can you clarify?