https://gcc.gnu.org/bugzilla/show_bug.cgi?id=63503
--- Comment #5 from Andrew Pinski <pinskia at gcc dot gnu.org> --- Also how sure are you that it is the fused multiply-add and not the scheduling of the instructions? As I mentioned, try swapping the cmp and fmadd; you might get a performance boost.