subject:"\[Bug tree\-optimization\/91934\] Performance regression on 8.3.0 with \-O3 and avx"

[Bug tree-optimization/91934] Performance regression on 8.3.0 with -O3 and avx

2019-10-01 Thread rguenth at gcc dot gnu.org

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=91934 Richard Biener changed: What|Removed |Added Depends on||87105, 87746, 87800 --- Comment #8

[Bug tree-optimization/91934] Performance regression on 8.3.0 with -O3 and avx

2019-10-01 Thread rguenth at gcc dot gnu.org

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=91934 --- Comment #7 from Richard Biener --- So the difference between good and bad is data-ref access analysis which figures single-element interleaving in GCC 8 and nicer interleaving in GCC 9 where I rewrote parts of that analysis: t.c:15:9: note:

[Bug tree-optimization/91934] Performance regression on 8.3.0 with -O3 and avx

2019-09-30 Thread jakub at gcc dot gnu.org

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=91934 Jakub Jelinek changed: What|Removed |Added CC||jakub at gcc dot gnu.org --- Comment #6

[Bug tree-optimization/91934] Performance regression on 8.3.0 with -O3 and avx

2019-09-30 Thread rguenth at gcc dot gnu.org

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=91934 Richard Biener changed: What|Removed |Added Status|WAITING |NEW Known to work|

[Bug tree-optimization/91934] Performance regression on 8.3.0 with -O3 and avx

2019-09-30 Thread tochansky at tochlab dot net

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=91934 --- Comment #4 from Dmitrii Tochanskii --- I'm not a good specialist in avx, so I just see something like loop unroll or may be very log data preparation. For example: = vmovups ymm3, YMMWORD PTR [r8+r9] vmovups ymm5,