https://gcc.gnu.org/bugzilla/show_bug.cgi?id=109896
--- Comment #4 from Andrew Pinski <pinskia at gcc dot gnu.org> --- (In reply to Thiago Macieira from comment #3) > 5 instructions, 4 cycles (not including front-end decode), so roughly the > same as the imulq example above (4 cycles), but with far more ports to > dispatch to. If you are that picky for cycles, these cycles are not going to be a problem compared to the dynamic allocation that is just about to happen ......