[Bug tree-optimization/101909] 73% regression on tfft benchmark for -O2 -ftree-loop-vectorize compared to -O2 on zen hardware
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=101909 --- Comment #5 from Hongtao.liu --- Just note there is no regression for tfft benchmark comparing -O2 -ftree-loop-vectorize to -O2 on CLX.
[Bug tree-optimization/101909] 73% regression on tfft benchmark for -O2 -ftree-loop-vectorize compared to -O2 on zen hardware
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=101909 --- Comment #4 from rguenther at suse dot de --- On Mon, 16 Aug 2021, marxin at gcc dot gnu.org wrote: > https://gcc.gnu.org/bugzilla/show_bug.cgi?id=101909 > > Martin Liška changed: > >What|Removed |Added > > CC||marxin at gcc dot gnu.org > > --- Comment #3 from Martin Liška --- > (In reply to Jan Hubicka from comment #2) > > > So that's znver1 (split AVX IIRC) compared to znver2? > > Martin will know how to decode machine names. I am never sure. > > It is with generic, so split AVX does not make difference. > > > > Honza > > zenith: > https://lnt.opensuse.org/db_default/v4/SPEC/machine/376 > > benzen: > https://lnt.opensuse.org/db_default/v4/SPEC/machine/393 So indeed zenith is znver1 and benzen znver2. But yeah, with generic -O2 we don't use AVX.
[Bug tree-optimization/101909] 73% regression on tfft benchmark for -O2 -ftree-loop-vectorize compared to -O2 on zen hardware
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=101909 Martin Liška changed: What|Removed |Added CC||marxin at gcc dot gnu.org --- Comment #3 from Martin Liška --- (In reply to Jan Hubicka from comment #2) > > So that's znver1 (split AVX IIRC) compared to znver2? > Martin will know how to decode machine names. I am never sure. > It is with generic, so split AVX does not make difference. > > Honza zenith: https://lnt.opensuse.org/db_default/v4/SPEC/machine/376 benzen: https://lnt.opensuse.org/db_default/v4/SPEC/machine/393
[Bug tree-optimization/101909] 73% regression on tfft benchmark for -O2 -ftree-loop-vectorize compared to -O2 on zen hardware
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=101909 --- Comment #2 from Jan Hubicka --- > So that's znver1 (split AVX IIRC) compared to znver2? Martin will know how to decode machine names. I am never sure. It is with generic, so split AVX does not make difference. Honza
[Bug tree-optimization/101909] 73% regression on tfft benchmark for -O2 -ftree-loop-vectorize compared to -O2 on zen hardware
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=101909 Richard Biener changed: What|Removed |Added Component|middle-end |tree-optimization Keywords||missed-optimization Blocks||53947 Target||x86_64-*-* CC||rguenth at gcc dot gnu.org --- Comment #1 from Richard Biener --- So that's znver1 (split AVX IIRC) compared to znver2? Referenced Bugs: https://gcc.gnu.org/bugzilla/show_bug.cgi?id=53947 [Bug 53947] [meta-bug] vectorizer missed-optimizations