https://gcc.gnu.org/bugzilla/show_bug.cgi?id=59371
Jiu Fu Guo <guojiufu at gcc dot gnu.org> changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |guojiufu at gcc dot gnu.org --- Comment #27 from Jiu Fu Guo <guojiufu at gcc dot gnu.org> --- For -O2, since a few optimizations are not enabled (e.g. some loop-based optimizations), the code was not optimized too much. At -O3, now, GCC could vectorize it. While with GCC 4.8, the code was not vectorized. I guess the pain in performance may be mitigated.