https://gcc.gnu.org/bugzilla/show_bug.cgi?id=87599
--- Comment #2 from vgatherps at gmail dot com --- Thanks! That fixes the optimization. However, using something like -march=haswell or -march=corei7 does not result in this optimization being made, which as far as I know -march=<intel-cpu> would imply -mtune=intel.