https://gcc.gnu.org/bugzilla/show_bug.cgi?id=84327
--- Comment #5 from xyzdr4gon333 at googlemail dot com --- Too bad. Before I have to take a longer look at the assembler code, any quick thoughts about what optimization not available as any single option could lead to the speedup of 4x?