https://gcc.gnu.org/bugzilla/show_bug.cgi?id=121213
--- Comment #1 from Vincenzo Romano <vincenzo.romano at gmail dot com> --- For point no.1 above, using a5 requires it to be reloaded with 0 at each loop. Using zero instead saves that one instruction.