[Bug tree-optimization/112325] Missed vectorization of reduction after unrolling

2024-05-29 Thread sjames at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=112325 Sam James changed: What|Removed |Added Target Milestone|--- |15.0

[Bug tree-optimization/112325] Missed vectorization of reduction after unrolling

2024-05-29 Thread liuhongt at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=112325 Hongtao Liu changed: What|Removed |Added Status|NEW |RESOLVED Resolution|---

[Bug tree-optimization/112325] Missed vectorization of reduction after unrolling

2024-05-29 Thread cvs-commit at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=112325 --- Comment #17 from GCC Commits --- The master branch has been updated by hongtao Liu : https://gcc.gnu.org/g:ef27b91b62c3aa8841c02665dffa8914c742fd37 commit r15-919-gef27b91b62c3aa8841c02665dffa8914c742fd37 Author: liuhongt Date: Tue Feb

[Bug tree-optimization/112325] Missed vectorization of reduction after unrolling

2024-02-28 Thread liuhongt at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=112325 --- Comment #16 from Hongtao Liu --- > I'm all for removing the 1/3 for innermost loop handling (in cunroll > the unrolled loop is then innermost). I'm more concerned about > unrolling more than one level which is exactly what's required for

[Bug tree-optimization/112325] Missed vectorization of reduction after unrolling

2024-02-28 Thread rguenther at suse dot de via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=112325 --- Comment #15 from rguenther at suse dot de --- On Wed, 28 Feb 2024, liuhongt at gcc dot gnu.org wrote: > https://gcc.gnu.org/bugzilla/show_bug.cgi?id=112325 > > --- Comment #14 from Hongtao Liu --- > (In reply to rguent...@suse.de from

[Bug tree-optimization/112325] Missed vectorization of reduction after unrolling

2024-02-27 Thread liuhongt at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=112325 --- Comment #14 from Hongtao Liu --- (In reply to rguent...@suse.de from comment #13) > On Tue, 27 Feb 2024, liuhongt at gcc dot gnu.org wrote: > > > https://gcc.gnu.org/bugzilla/show_bug.cgi?id=112325 > > > > --- Comment #11 from Hongtao Liu

[Bug tree-optimization/112325] Missed vectorization of reduction after unrolling

2024-02-26 Thread rguenther at suse dot de via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=112325 --- Comment #13 from rguenther at suse dot de --- On Tue, 27 Feb 2024, liuhongt at gcc dot gnu.org wrote: > https://gcc.gnu.org/bugzilla/show_bug.cgi?id=112325 > > --- Comment #11 from Hongtao Liu --- > > >Loop body is likely going to

[Bug tree-optimization/112325] Missed vectorization of reduction after unrolling

2024-02-26 Thread rguenther at suse dot de via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=112325 --- Comment #12 from rguenther at suse dot de --- On Tue, 27 Feb 2024, liuhongt at gcc dot gnu.org wrote: > https://gcc.gnu.org/bugzilla/show_bug.cgi?id=112325 > > --- Comment #10 from Hongtao Liu --- > (In reply to Hongtao Liu from comment

[Bug tree-optimization/112325] Missed vectorization of reduction after unrolling

2024-02-26 Thread liuhongt at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=112325 --- Comment #11 from Hongtao Liu --- >Loop body is likely going to simplify further, this is difficult >to guess, we just decrease the result by 1/3. */ > This is introduced by r0-68074-g91a01f21abfe19 /* Estimate number of insns

[Bug tree-optimization/112325] Missed vectorization of reduction after unrolling

2024-02-26 Thread liuhongt at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=112325 --- Comment #10 from Hongtao Liu --- (In reply to Hongtao Liu from comment #9) > The original case is a little different from the one in PR. But the issue is similar, after cunrolli, GCC failed to vectorize the outer loop. The interesting

[Bug tree-optimization/112325] Missed vectorization of reduction after unrolling

2024-02-26 Thread liuhongt at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=112325 --- Comment #9 from Hongtao Liu --- The original case is a little different from the one in PR. It comes from ggml #include #include typedef uint16_t ggml_fp16_t; static float table_f32_f16[1 << 16]; inline static float

[Bug tree-optimization/112325] Missed vectorization of reduction after unrolling

2023-11-20 Thread cvs-commit at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=112325 --- Comment #8 from CVS Commits --- The master branch has been updated by hongtao Liu : https://gcc.gnu.org/g:e5e305e6048c042139037378fe6abfad5735b54f commit r14-5632-ge5e305e6048c042139037378fe6abfad5735b54f Author: liuhongt Date: Fri Nov

[Bug tree-optimization/112325] Missed vectorization of reduction after unrolling

2023-11-19 Thread cvs-commit at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=112325 --- Comment #7 from CVS Commits --- The master branch has been updated by hongtao Liu : https://gcc.gnu.org/g:2b59e2b4dff42118fe3a505f07b9a6aa4cf53bdf commit r14-5603-g2b59e2b4dff42118fe3a505f07b9a6aa4cf53bdf Author: liuhongt Date: Thu Nov

[Bug tree-optimization/112325] Missed vectorization of reduction after unrolling

2023-11-16 Thread pinskia at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=112325 Andrew Pinski changed: What|Removed |Added Depends on||106343 --- Comment #6 from Andrew

[Bug tree-optimization/112325] Missed vectorization of reduction after unrolling

2023-11-16 Thread pinskia at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=112325 Bug 112325 depends on bug 112579, which changed state. Bug 112579 Summary: bb vectorizer failed to reduction sum += inv >> {0,1,2,3,4,5,6,7,8,9,10,11,12,13,14,15} https://gcc.gnu.org/bugzilla/show_bug.cgi?id=112579 What|Removed

[Bug tree-optimization/112325] Missed vectorization of reduction after unrolling

2023-11-16 Thread rguenth at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=112325 --- Comment #5 from Richard Biener --- Yes, as I said in comment#2. Note I specifically ended up not open-coding the reduction because of concerns of efficiency. So a target should only provide reduc_*_scal patterns when they are more

[Bug tree-optimization/112325] Missed vectorization of reduction after unrolling

2023-11-16 Thread liuhongt at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=112325 --- Comment #4 from liuhongt at gcc dot gnu.org --- (In reply to liuhongt from comment #3) > BB vectorizer relies on the backend support of .REDUC_PLUS for reduction, > but loop vectorizer can manually do reduction. That's why it's not >

[Bug tree-optimization/112325] Missed vectorization of reduction after unrolling

2023-11-16 Thread liuhongt at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=112325 liuhongt at gcc dot gnu.org changed: What|Removed |Added CC||liuhongt at gcc dot

[Bug tree-optimization/112325] Missed vectorization of reduction after unrolling

2023-11-02 Thread rguenth at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=112325 Richard Biener changed: What|Removed |Added CC||rguenth at gcc dot gnu.org