[Bug rtl-optimization/69710] performance issue with SP Linpack with Autovectorization

2021-05-04 Thread rguenth at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=69710 Richard Biener changed: What|Removed |Added Status|NEW |ASSIGNED

[Bug rtl-optimization/69710] performance issue with SP Linpack with Autovectorization

2016-03-19 Thread ramana at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=69710 Ramana Radhakrishnan changed: What|Removed |Added Status|UNCONFIRMED |NEW Last reconfirmed|

[Bug rtl-optimization/69710] performance issue with SP Linpack with Autovectorization

2016-03-07 Thread doug.gilmore at imgtec dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=69710 --- Comment #15 from Doug Gilmore --- > I had a patch too, will send it for review in GCC7 if it's still needed. Sorry I got side track last week and didn't make much progress. Please go ahead and submit if you have something you feel

[Bug rtl-optimization/69710] performance issue with SP Linpack with Autovectorization

2016-03-07 Thread amker at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=69710 --- Comment #14 from amker at gcc dot gnu.org --- (In reply to Doug Gilmore from comment #13) > I think this should be fairly straightforward to fix in the > autovectorization pass. Hopefully I should be able to post a patch > in the next few

[Bug rtl-optimization/69710] performance issue with SP Linpack with Autovectorization

2016-02-24 Thread doug.gilmore at imgtec dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=69710 --- Comment #13 from Doug Gilmore --- I think this should be fairly straightforward to fix in the autovectorization pass. Hopefully I should be able to post a patch in the next few days.

[Bug rtl-optimization/69710] performance issue with SP Linpack with Autovectorization

2016-02-16 Thread doug.gilmore at imgtec dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=69710 --- Comment #12 from Doug Gilmore --- > Yes, I proposed some cleanup passess after vectorization but richi > thinks it's genrally expensive. So what's implmentation complexity > of pass_dominator? One thing we might consider is only enable it

[Bug rtl-optimization/69710] performance issue with SP Linpack with Autovectorization

2016-02-14 Thread amker at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=69710 --- Comment #11 from amker at gcc dot gnu.org --- (In reply to Doug Gilmore from comment #10) > Created attachment 37681 [details] > prototype fix > > > 1) we failed recognize that use 0 and 2 are identical to each other. > > This is because

[Bug rtl-optimization/69710] performance issue with SP Linpack with Autovectorization

2016-02-13 Thread doug.gilmore at imgtec dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=69710 --- Comment #10 from Doug Gilmore --- Created attachment 37681 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=37681=edit prototype fix > 1) we failed recognize that use 0 and 2 are identical to each other. > This is because vectorizer

[Bug rtl-optimization/69710] performance issue with SP Linpack with Autovectorization

2016-02-10 Thread amker at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=69710 --- Comment #8 from amker at gcc dot gnu.org --- Reproduced on arm with saxpy.c. The dump for slp is as below: : _82 = prologue_after_cost_adjust.7_43 * 4; vectp_dy.13_81 = dy_9(D) + _82; _87 = prologue_after_cost_adjust.7_43 * 4;

[Bug rtl-optimization/69710] performance issue with SP Linpack with Autovectorization

2016-02-10 Thread amker at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=69710 --- Comment #9 from amker at gcc dot gnu.org --- Also the problem exists only for arm because it doesn't support [base+index] addressing mode for vect load/store. I guess mips doesn't either.

[Bug rtl-optimization/69710] performance issue with SP Linpack with Autovectorization

2016-02-09 Thread amker at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=69710 amker at gcc dot gnu.org changed: What|Removed |Added CC||amker at gcc dot gnu.org ---

[Bug rtl-optimization/69710] performance issue with SP Linpack with Autovectorization

2016-02-09 Thread amker at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=69710 --- Comment #7 from amker at gcc dot gnu.org --- Hmm, the first problem is the two iv uses from dy load/store are not recognized as having same base address/object. This may caused by my patch disabling expansion of iv base. Or it exists all

[Bug rtl-optimization/69710] performance issue with SP Linpack with Autovectorization

2016-02-06 Thread pinskia at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=69710 Andrew Pinski changed: What|Removed |Added Keywords||missed-optimization

[Bug rtl-optimization/69710] performance issue with SP Linpack with Autovectorization

2016-02-06 Thread doug.gilmore at imgtec dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=69710 --- Comment #5 from Doug Gilmore --- Thanks for checking on AArch64 Andrew. BTW, I made my (incorrect) hunch by running a test on gcc113, where the installed 4.8 compile showed problems for both DP and SP. (I assumed that the problem was