[Bug rtl-optimization/108086] [11/12/13 Regression] internal compiler error: in set_accesses, at rtl-ssa/internals.inl:449

2022-12-16 Thread rsandifo at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=108086 rsandifo at gcc dot gnu.org changed: What|Removed |Added Assignee|unassigned at gcc dot gnu.org |rsandifo at gcc

[Bug middle-end/107991] [10/11/12/13 Regression] Extra mov instructions with ternary on x86

2022-12-21 Thread rsandifo at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=107991 --- Comment #5 from rsandifo at gcc dot gnu.org --- >From a slightly old build, but it looks like we have a redundant move: (insn 4 27 28 2 (set (reg/v:SI 85 [ i ]) (reg:SI 91)) "foo.c":9:31 83 {*movsi_internal}

[Bug target/107812] [11/12/13 Regression] RTL SSA forwprop introduced regression since r11-6188

2022-11-22 Thread rsandifo at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=107812 rsandifo at gcc dot gnu.org changed: What|Removed |Added Last reconfirmed||2022-11-22 Ever

[Bug libgomp/96844] OpenMP: two worksharing constructs with different num_threads clauses break thread pooling

2023-01-16 Thread rsandifo at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=96844 rsandifo at gcc dot gnu.org changed: What|Removed |Added CC||rsandifo at gcc dot

[Bug tree-optimization/108571] New: Fix for PR96373 regresses fabd_1.c with -ftrapping-math

2023-01-27 Thread rsandifo at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=108571 Bug ID: 108571 Summary: Fix for PR96373 regresses fabd_1.c with -ftrapping-math Product: gcc Version: 13.0 Status: UNCONFIRMED Keywords: missed-optimization

[Bug target/108583] [13 Regression] wrong code with vector division by uint16 at -O2

2023-01-30 Thread rsandifo at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=108583 --- Comment #6 from rsandifo at gcc dot gnu.org --- (In reply to Tamar Christina from comment #3) > The vectorizer has this context but since we didn't want a new IFN the > context should instead be derivable in >

[Bug tree-optimization/108608] [12/13 Regression] AArch64 vectorizer ICE in vect_transform_reduction and __builtin_fmax

2023-01-30 Thread rsandifo at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=108608 rsandifo at gcc dot gnu.org changed: What|Removed |Added Status|NEW |ASSIGNED

[Bug target/108583] [13 Regression] wrong code with vector division by uint16 at -O2

2023-01-31 Thread rsandifo at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=108583 --- Comment #9 from rsandifo at gcc dot gnu.org --- Are we sure this is a vectoriser vs. C vectors thing? The equivalent vectoriser test: void __attribute__((noipa)) f (unsigned *v) { for (int i = 0; i < 4; ++i) v[i] /= 0x; } int

[Bug target/108583] [13 Regression] wrong code with vector division by uint16 at -O2

2023-01-31 Thread rsandifo at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=108583 --- Comment #13 from rsandifo at gcc dot gnu.org --- OK, hopefully I understand now. Sorry for being slow. But what specific constraints do we want to apply to the optimisation? (In reply to Tamar Christina from comment #3) > Right, so this

[Bug rtl-optimization/108508] [12/13 Regression] ICE in insert_def_after, at rtl-ssa/accesses.cc:622 since r12-4759-g95bb87b2458bfa

2023-02-01 Thread rsandifo at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=108508 rsandifo at gcc dot gnu.org changed: What|Removed |Added Status|NEW |ASSIGNED

[Bug rtl-optimization/108086] [11/12/13 Regression] internal compiler error: in set_accesses, at rtl-ssa/internals.inl:449

2023-02-02 Thread rsandifo at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=108086 --- Comment #16 from rsandifo at gcc dot gnu.org --- (In reply to Richard Biener from comment #13) > (In reply to Richard Biener from comment #12) > > A regression from GCC 10 which compiles this in 90s at -O1. > > > > Richard? Can you

[Bug middle-end/108623] We need to grow the precision field in tree_type_common for PowerPC

2023-02-02 Thread rsandifo at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=108623 --- Comment #10 from rsandifo at gcc dot gnu.org --- (In reply to rguent...@suse.de from comment #9) > Please you do it, as far as I understand Richard S. no further adjustment > is necessary but we could simplify some code after the change(?)

[Bug target/109072] [12/13 Regression] SLP costs for vec duplicate too high since g:4963079769c99c4073adfd799885410ad484cbbe

2023-03-10 Thread rsandifo at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=109072 --- Comment #8 from rsandifo at gcc dot gnu.org --- (In reply to Richard Biener from comment #7) > Whatever you do with cost heuristics you'll find a testcase where that > regresses. Yep. That's the one true invariant of costing :-) I think

[Bug target/109072] [12/13 Regression] SLP costs for vec duplicate too high since g:4963079769c99c4073adfd799885410ad484cbbe

2023-03-09 Thread rsandifo at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=109072 rsandifo at gcc dot gnu.org changed: What|Removed |Added Assignee|unassigned at gcc dot gnu.org |rsandifo at gcc

[Bug target/109072] [12/13 Regression] SLP costs for vec duplicate too high since g:4963079769c99c4073adfd799885410ad484cbbe

2023-03-09 Thread rsandifo at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=109072 --- Comment #3 from rsandifo at gcc dot gnu.org --- (In reply to Tamar Christina from comment #2) > I thought the SLP algorithm was bottom up and stores were > already sinks? Yeah, they are. But the point is that we're vectorising the stores

[Bug target/109072] [12/13 Regression] SLP costs for vec duplicate too high since g:4963079769c99c4073adfd799885410ad484cbbe

2023-03-09 Thread rsandifo at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=109072 rsandifo at gcc dot gnu.org changed: What|Removed |Added Status|UNCONFIRMED |NEW Ever confirmed|0

[Bug target/109072] [12/13 Regression] SLP costs for vec duplicate too high since g:4963079769c99c4073adfd799885410ad484cbbe

2023-03-09 Thread rsandifo at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=109072 --- Comment #5 from rsandifo at gcc dot gnu.org --- Following an off-list discussion: maybe one option (for now) would be to make the aarch64 builtins lowering code look for vld1s whose arguments are ADDR_EXPRs of local VAR_DECLs (or maybe

[Bug rtl-optimization/108681] [12 Regression] gcc hangs compiling opencv/channels_combine.cpp for aarch64

2023-03-02 Thread rsandifo at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=108681 rsandifo at gcc dot gnu.org changed: What|Removed |Added Known to work|12.2.1 | --- Comment #14 from

[Bug tree-optimization/108430] [13 Regression] Wrong code with -msve-vector-bits=512 since r13-707-g68e0063397

2023-03-02 Thread rsandifo at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=108430 --- Comment #4 from rsandifo at gcc dot gnu.org --- Fixed on trunk, but the underlying bug is present in GCC 12 too.

[Bug tree-optimization/108979] [13 Regression] ICE in compute_live_loop_exits, at tree-ssa-loop-manip.cc:248

2023-03-02 Thread rsandifo at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=108979 rsandifo at gcc dot gnu.org changed: What|Removed |Added Status|NEW |ASSIGNED

[Bug tree-optimization/108979] [13 Regression] ICE in compute_live_loop_exits, at tree-ssa-loop-manip.cc:248

2023-03-02 Thread rsandifo at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=108979 rsandifo at gcc dot gnu.org changed: What|Removed |Added Resolution|--- |FIXED

[Bug rtl-optimization/106594] [13 Regression] sign-extensions no longer merged into addressing mode

2023-03-06 Thread rsandifo at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=106594 --- Comment #19 from rsandifo at gcc dot gnu.org --- I completely agree with comment 18. See: https://gcc.gnu.org/pipermail/gcc-patches/2022-September/601472.html for some more on a similar theme. make_compound_operation_int already has code

[Bug target/109535] internal compiler error: in finalize_new_accesses, at rtl-ssa/changes.cc:471

2023-04-17 Thread rsandifo at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=109535 --- Comment #2 from rsandifo at gcc dot gnu.org --- The assert in question fires if the pass creates an instruction whose pattern uses a register or memory and if the pass doesn't provide associated use information. Let me know if it looks

[Bug middle-end/109505] (t | 15) & svcntb() causes an OOM/ICE

2023-04-17 Thread rsandifo at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=109505 --- Comment #10 from rsandifo at gcc dot gnu.org --- Might be a daft question, but which cases besides INTEGER_CST are supposed to be captured by the CONSTANT_CLASS_P?

[Bug middle-end/109505] (t | 15) & svcntb() causes an OOM/ICE

2023-04-17 Thread rsandifo at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=109505 --- Comment #12 from rsandifo at gcc dot gnu.org --- (In reply to Andrew Pinski from comment #11) > For bit_and/bit_ior, VECTOR_CST (I would assume). Ah, yeah. But then I don't think a top-level POLY_INT_CST_P cuts it. We'd have the same

[Bug middle-end/109505] (t | 15) & svcntb() causes an OOM/ICE

2023-04-17 Thread rsandifo at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=109505 --- Comment #15 from rsandifo at gcc dot gnu.org --- (In reply to Jakub Jelinek from comment #13) > (In reply to rsand...@gcc.gnu.org from comment #12) > > (In reply to Andrew Pinski from comment #11) > > > For bit_and/bit_ior, VECTOR_CST (I

[Bug target/109535] internal compiler error: in finalize_new_accesses, at rtl-ssa/changes.cc:471

2023-04-17 Thread rsandifo at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=109535 --- Comment #8 from rsandifo at gcc dot gnu.org --- Can you quote a dump of the new insn pattern? Or just: dump(change); in function_info::finalize_new_accesses should do.

[Bug target/108910] [12 Regression] Further ICE in aarch64_layout_arg

2023-04-17 Thread rsandifo at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=108910 rsandifo at gcc dot gnu.org changed: What|Removed |Added Resolution|--- |FIXED

[Bug target/109632] Inefficient codegen when complex numbers are emulated with structs

2023-04-27 Thread rsandifo at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=109632 rsandifo at gcc dot gnu.org changed: What|Removed |Added CC||rsandifo at gcc dot

[Bug target/109632] Inefficient codegen when complex numbers are emulated with structs

2023-04-27 Thread rsandifo at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=109632 --- Comment #5 from rsandifo at gcc dot gnu.org --- Created attachment 54941 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=54941=edit hacky proof-of-concept patch This is a very hacky proof of concept patch. Don't try it on anything

[Bug target/109632] Inefficient codegen when complex numbers are emulated with structs

2023-04-27 Thread rsandifo at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=109632 --- Comment #7 from rsandifo at gcc dot gnu.org --- Thinking more about it, it would probably be better to defer the split until around lower_complex time, after IPA (especially inlining), NRV and tail-recursion. Doing it there should also

[Bug target/109632] Inefficient codegen when complex numbers are emulated with structs

2023-04-27 Thread rsandifo at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=109632 rsandifo at gcc dot gnu.org changed: What|Removed |Added Status|UNCONFIRMED |ASSIGNED Last

[Bug target/109636] [14 Regression] ICE: in paradoxical_subreg_p, at rtl.h:3205 with -O -mcpu=a64fx

2023-04-28 Thread rsandifo at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=109636 --- Comment #6 from rsandifo at gcc dot gnu.org --- Ugh. I guess we've got no option but to force the original subreg into a fresh register, but that's going to pessimise cases where arithmetic is done on tuple types. Perhaps we should just

[Bug target/109610] [14 regression] gcc.target/powerpc/dform-3.c fails after r14-172-g0368d169492017

2023-04-26 Thread rsandifo at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=109610 rsandifo at gcc dot gnu.org changed: What|Removed |Added CC||rsandifo at gcc dot

[Bug bootstrap/109510] [13 Regression] bootstrap with Ada broken on aarch64

2023-04-14 Thread rsandifo at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=109510 --- Comment #8 from rsandifo at gcc dot gnu.org --- (In reply to Jakub Jelinek from comment #7) > In patch form what I wrote above (completely untested): Sorry in advance for the overly verbose comment, but the timeline here was that: -

[Bug target/109661] [13/14 Regression] ICE in aarch64_function_arg_alignment when building erlang

2023-04-28 Thread rsandifo at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=109661 rsandifo at gcc dot gnu.org changed: What|Removed |Added Assignee|unassigned at gcc dot gnu.org |rsandifo at gcc

[Bug target/109661] [13/14 Regression] ICE in aarch64_function_arg_alignment when building erlang

2023-04-28 Thread rsandifo at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=109661 --- Comment #4 from rsandifo at gcc dot gnu.org --- Yeah, I'd come up with essentially the same fix locally. Was just trying to see whether it is an ABI problem. And I think it is. For: typedef unsigned long U __attribute__ ((aligned

[Bug target/109661] [13/14 Regression] ICE in aarch64_function_arg_alignment when building erlang

2023-05-02 Thread rsandifo at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=109661 --- Comment #6 from rsandifo at gcc dot gnu.org --- I'm working on a patch (though not literally right now).

[Bug target/109632] Inefficient codegen when complex numbers are emulated with structs

2023-05-02 Thread rsandifo at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=109632 --- Comment #10 from rsandifo at gcc dot gnu.org --- After prototyping this further, I no longer think that lowering at the gimple level is the best answer. (I should have listened to Richi.) Although it works, its major drawback is that

[Bug bootstrap/109510] [13 Regression] bootstrap with Ada broken on aarch64

2023-04-14 Thread rsandifo at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=109510 --- Comment #12 from rsandifo at gcc dot gnu.org --- (In reply to Eric Botcazou from comment #10) > Created attachment 54859 [details] > Tentative fix > > To be tested. Thanks! This works for me locally and gives clean Ada test results.

[Bug rtl-optimization/108508] [11 Backport] ICE in insert_def_after, at rtl-ssa/accesses.cc:622 since r12-4759-g95bb87b2458bfa

2023-04-03 Thread rsandifo at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=108508 rsandifo at gcc dot gnu.org changed: What|Removed |Added Summary|[12 Regression] ICE in |[11 Backport] ICE in

[Bug tree-optimization/108430] [12 Regression] Wrong code with -msve-vector-bits=512 since r13-707-g68e0063397

2023-04-03 Thread rsandifo at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=108430 rsandifo at gcc dot gnu.org changed: What|Removed |Added Resolution|--- |FIXED

[Bug target/109072] [12 Regression] SLP costs for vec duplicate too high since g:4963079769c99c4073adfd799885410ad484cbbe

2023-04-03 Thread rsandifo at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=109072 rsandifo at gcc dot gnu.org changed: What|Removed |Added Resolution|--- |FIXED

[Bug tree-optimization/108608] [12 Regression] AArch64 vectorizer ICE in vect_transform_reduction and __builtin_fmax

2023-04-03 Thread rsandifo at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=108608 rsandifo at gcc dot gnu.org changed: What|Removed |Added Status|ASSIGNED|RESOLVED

[Bug target/108603] [12 Regression] ICE in output_operand_lossage with SVE and ilp32

2023-04-03 Thread rsandifo at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=108603 rsandifo at gcc dot gnu.org changed: What|Removed |Added Status|ASSIGNED|RESOLVED

[Bug rtl-optimization/108681] [12 Regression] gcc hangs compiling opencv/channels_combine.cpp for aarch64

2023-04-03 Thread rsandifo at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=108681 rsandifo at gcc dot gnu.org changed: What|Removed |Added Status|REOPENED|RESOLVED

[Bug target/109436] AArch64: suboptimal codegen in 128 bit constant stores

2023-04-11 Thread rsandifo at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=109436 rsandifo at gcc dot gnu.org changed: What|Removed |Added CC||rsandifo at gcc dot

[Bug c/24542] potential unwanted truncation of operation overflow should be warned on assignment to wider variable

2023-03-31 Thread rsandifo at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=24542 rsandifo at gcc dot gnu.org changed: What|Removed |Added CC||rsandifo at gcc dot

[Bug target/109072] [12 Regression] SLP costs for vec duplicate too high since g:4963079769c99c4073adfd799885410ad484cbbe

2023-03-28 Thread rsandifo at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=109072 rsandifo at gcc dot gnu.org changed: What|Removed |Added Summary|[12/13 Regression] SLP |[12 Regression] SLP costs

[Bug target/109498] New: SVE support for ctz

2023-04-13 Thread rsandifo at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=109498 Bug ID: 109498 Summary: SVE support for ctz Product: gcc Version: 13.0 Status: UNCONFIRMED Severity: enhancement Priority: P3 Component: target

[Bug target/109499] Unnecessary zeroing in SVE loops

2023-04-13 Thread rsandifo at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=109499 --- Comment #4 from rsandifo at gcc dot gnu.org --- (In reply to rguent...@suse.de from comment #3) > AVX512 masking allows merge and zero modes, zero being cheaper > (obviously). I think "zero" is what all targets support so we could >

[Bug target/108910] [12 Regression] Further ICE in aarch64_layout_arg

2023-04-13 Thread rsandifo at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=108910 rsandifo at gcc dot gnu.org changed: What|Removed |Added Summary|[12/13 Regression] Further |[12 Regression] Further

[Bug tree-optimization/109269] [sve] should check the upper bound for predicate sve

2023-04-13 Thread rsandifo at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=109269 rsandifo at gcc dot gnu.org changed: What|Removed |Added Ever confirmed|0 |1

[Bug target/109499] New: Unnecessary zeroing in SVE loops

2023-04-13 Thread rsandifo at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=109499 Bug ID: 109499 Summary: Unnecessary zeroing in SVE loops Product: gcc Version: 13.0 Status: UNCONFIRMED Severity: enhancement Priority: P3 Component: target

[Bug target/109499] Unnecessary zeroing in SVE loops

2023-04-13 Thread rsandifo at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=109499 --- Comment #2 from rsandifo at gcc dot gnu.org --- (In reply to Richard Biener from comment #1) > Is there not enough info to catch this on the RTL level with a peephole? That works for simple cases like the first loop. But in general, I

[Bug rtl-optimization/108681] [13 Regression] gcc hangs compiling opencv/channels_combine.cpp for aarch64

2023-02-06 Thread rsandifo at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=108681 rsandifo at gcc dot gnu.org changed: What|Removed |Added Assignee|unassigned at gcc dot gnu.org |rsandifo at gcc

[Bug target/108316] [13 Regression] ICE in maybe_gen_insn via expand_SCATTER_STORE when vectorizing for SVE since r13-2737-g4a773bf2f08656

2023-02-07 Thread rsandifo at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=108316 rsandifo at gcc dot gnu.org changed: What|Removed |Added Status|NEW |ASSIGNED

[Bug rtl-optimization/105383] [11/12/13 Regression] ICE: in change_insns, at rtl-ssa/changes.cc:674 with -Os -m32 since r11-6188-g0b76990a9d75d97b

2023-02-09 Thread rsandifo at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=105383 rsandifo at gcc dot gnu.org changed: What|Removed |Added Assignee|unassigned at gcc dot gnu.org |rsandifo at gcc

[Bug target/108603] [12/13 Regression] ICE in output_operand_lossage with SVE and ilp32

2023-02-09 Thread rsandifo at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=108603 rsandifo at gcc dot gnu.org changed: What|Removed |Added Last reconfirmed||2023-02-09

[Bug tree-optimization/108430] [13 Regression] Wrong code with -msve-vector-bits=512 since r13-707-g68e0063397

2023-02-09 Thread rsandifo at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=108430 rsandifo at gcc dot gnu.org changed: What|Removed |Added CC||rsandifo at gcc dot

[Bug tree-optimization/108750] New: Loop unswitching fails for poly_int conditions

2023-02-10 Thread rsandifo at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=108750 Bug ID: 108750 Summary: Loop unswitching fails for poly_int conditions Product: gcc Version: 13.0 Status: UNCONFIRMED Severity: normal Priority: P3 Component:

[Bug tree-optimization/108748] New: Enhancement: track ranges of poly_int indeterminates

2023-02-10 Thread rsandifo at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=108748 Bug ID: 108748 Summary: Enhancement: track ranges of poly_int indeterminates Product: gcc Version: 13.0 Status: UNCONFIRMED Severity: enhancement Priority: P3

[Bug rtl-optimization/108681] [13 Regression] gcc hangs compiling opencv/channels_combine.cpp for aarch64

2023-02-13 Thread rsandifo at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=108681 rsandifo at gcc dot gnu.org changed: What|Removed |Added Status|ASSIGNED|RESOLVED

[Bug middle-end/108623] We need to grow the precision field in tree_type_common for PowerPC

2023-02-01 Thread rsandifo at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=108623 --- Comment #3 from rsandifo at gcc dot gnu.org --- The explanation is in the SET_TYPE_VECTOR_SUBPARTS code: /* We have two coefficients that are each in the range 1 << [0, 63], so supporting all combinations would require 6

[Bug tree-optimization/108748] Enhancement: track ranges of poly_int indeterminates

2023-02-10 Thread rsandifo at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=108748 --- Comment #2 from rsandifo at gcc dot gnu.org --- (In reply to Richard Biener from comment #1) > That said, we'd track a "virtual" variables range here. For the above > I wonder why we cannot constant fold it - [16, 16] can never be 2, no?

[Bug tree-optimization/110248] ivopts could under-cost for some addressing modes on len_{load,store}

2023-06-14 Thread rsandifo at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=110248 --- Comment #5 from rsandifo at gcc dot gnu.org --- ivopts does have code to treat ifn pointer arguments specially, see get_mem_type_for_internal_fn But like Kewen says, it's still only based on the mode. Personally I'd prefer an

[Bug testsuite/70150] Additonal test failures with --enable-default-pie

2023-06-15 Thread rsandifo at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=70150 --- Comment #32 from rsandifo at gcc dot gnu.org --- (In reply to Xi Ruoyao from comment #31) > Richard: is it allowed to backport them (or the entire > https://gcc.gnu.org/pipermail/gcc-patches/2023-March/613093.html series) for > gcc-12?

[Bug tree-optimization/110485] vectorizing simd clone calls without loop masking applied

2023-07-02 Thread rsandifo at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=110485 rsandifo at gcc dot gnu.org changed: What|Removed |Added CC||avieira at gcc dot

[Bug middle-end/110495] fre introduces signed wrap for vector

2023-07-03 Thread rsandifo at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=110495 --- Comment #2 from rsandifo at gcc dot gnu.org --- The point of the builder is that, if you know the pattern, you don't need to supply every element value to the builder. (And indeed you can't when the vector is variable length.) So

[Bug target/110625] [AArch64] Vect: SLP fails to vectorize a loop as the reduction_latency calculated by new costs is too large

2023-07-18 Thread rsandifo at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=110625 rsandifo at gcc dot gnu.org changed: What|Removed |Added Status|UNCONFIRMED |ASSIGNED

[Bug target/110625] [AArch64] Vect: SLP fails to vectorize a loop as the reduction_latency calculated by new costs is too large

2023-07-18 Thread rsandifo at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=110625 rsandifo at gcc dot gnu.org changed: What|Removed |Added CC||rsandifo at gcc dot

[Bug target/110625] [AArch64] Vect: SLP fails to vectorize a loop as the reduction_latency calculated by new costs is too large

2023-07-19 Thread rsandifo at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=110625 --- Comment #7 from rsandifo at gcc dot gnu.org --- The current issue rate framework was originally written for Neoverse V1 and Neoverse V2. For those cores, it wasn't necessary to make a distinction between scalar integer operations and

[Bug target/110751] RISC-V: Suport undefined value that allows VSETVL PASS use TA/MA

2023-07-20 Thread rsandifo at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=110751 rsandifo at gcc dot gnu.org changed: What|Removed |Added CC||rsandifo at gcc dot

[Bug target/110039] [14 Regression] FAIL: gcc.target/aarch64/rev16_2.c scan-assembler-times rev16\\tw[0-9]+ 2

2023-05-31 Thread rsandifo at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=110039 rsandifo at gcc dot gnu.org changed: What|Removed |Added CC||rsandifo at gcc dot

[Bug rtl-optimization/109940] [14 Regression] ICE in decide_candidate_validity since g:53dddbfeb213ac4ec39f550aa81eaa4264375d2c

2023-05-24 Thread rsandifo at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=109940 rsandifo at gcc dot gnu.org changed: What|Removed |Added Status|ASSIGNED|RESOLVED

[Bug target/109964] auto-vectorization of shift ignores integral promotions

2023-05-25 Thread rsandifo at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=109964 --- Comment #4 from rsandifo at gcc dot gnu.org --- (In reply to Richard Biener from comment #3) > So the bug in the vectorizer is that it does > > t.ii:14:5: note: can narrow to signed:16 without loss of precision: _31 = > 1 >> _30; >

[Bug rtl-optimization/109940] [14 Regression] ICE in decide_candidate_validity since g:53dddbfeb213ac4ec39f550aa81eaa4264375d2c

2023-05-23 Thread rsandifo at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=109940 rsandifo at gcc dot gnu.org changed: What|Removed |Added Assignee|unassigned at gcc dot gnu.org |rsandifo at gcc

[Bug target/109632] Inefficient codegen when complex numbers are emulated with structs

2023-05-23 Thread rsandifo at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=109632 --- Comment #12 from rsandifo at gcc dot gnu.org --- The patch in comment 11 is just a related spot improvement. The PR itself is still unfixed.

[Bug target/110105] ARM GCC: underoptimization: expected vfma.f16, actual vcvtb-vfma.f32-vcvtb

2023-06-09 Thread rsandifo at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=110105 rsandifo at gcc dot gnu.org changed: What|Removed |Added CC||rsandifo at gcc dot

[Bug target/109855] [14 Regression] ICE: in curr_insn_transform, at lra-constraints.cc:4231 unable to generate reloads for {aarch64_mlav4hi_vec_concatz_le} at -O1

2023-05-22 Thread rsandifo at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=109855 --- Comment #3 from rsandifo at gcc dot gnu.org --- Looking at the mddump file, the output predicate and constraint seem to have gone AWOL: ;; /home/ricsan01/gnu/src/gcc/gcc/config/aarch64/aarch64-simd.md: 1554 (define_insn

[Bug target/109855] [14 Regression] ICE: in curr_insn_transform, at lra-constraints.cc:4231 unable to generate reloads for {aarch64_mlav4hi_vec_concatz_le} at -O1

2023-05-22 Thread rsandifo at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=109855 --- Comment #4 from rsandifo at gcc dot gnu.org --- I guess the problem is that the define_subst output template has: (match_operand: 0) which creates a new operand 0 with an empty predicate and constraint, as opposed to a (match_dup 0),

[Bug target/110751] RISC-V: Suport undefined value that allows VSETVL PASS use TA/MA

2023-07-21 Thread rsandifo at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=110751 --- Comment #20 from rsandifo at gcc dot gnu.org --- (In reply to Richard Biener from comment #19) > Sure, I can kind of see the usefulness elsewhere. Just for this particular > issue it doesn't seem necessary to sit down and design this when

[Bug target/110751] RISC-V: Suport undefined value that allows VSETVL PASS use TA/MA

2023-07-21 Thread rsandifo at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=110751 --- Comment #18 from rsandifo at gcc dot gnu.org --- I'd understood LLVM's undef as essentially being “unspecified”, or “unspecified bit-pattern” to quote the docs. It doesn't indicate undefined behaviour in the C/C++ sense: Undefined

[Bug middle-end/106081] missed vectorization

2023-07-26 Thread rsandifo at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=106081 --- Comment #14 from rsandifo at gcc dot gnu.org --- FWIW, changing: if (!STMT_VINFO_GROUPED_ACCESS (dr_stmt)) continue; to: if (!STMT_VINFO_GROUPED_ACCESS (dr_stmt)) {

[Bug middle-end/106081] missed vectorization

2023-07-26 Thread rsandifo at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=106081 --- Comment #13 from rsandifo at gcc dot gnu.org --- (In reply to Richard Biener from comment #12) > Btw, I see we actually materialize a permute before the splat: > > t.c:14:24: note: node 0x5b311c0 (max_nunits=1, refcnt=2) vector(2)

[Bug middle-end/106081] missed vectorization

2023-07-26 Thread rsandifo at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=106081 --- Comment #10 from rsandifo at gcc dot gnu.org --- (In reply to Richard Biener from comment #9) > So I can adjust change_layout_cost in a bit awkward way, but it seems that > internal_node_cost would already work out that a permute can be

[Bug target/110625] [AArch64] Vect: SLP fails to vectorize a loop as the reduction_latency calculated by new costs is too large

2023-07-28 Thread rsandifo at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=110625 --- Comment #10 from rsandifo at gcc dot gnu.org --- Created attachment 55654 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=55654=edit Candidate patch (part 2) Sorry for the delay. I'm testing the attached two patches to fix the

[Bug target/110625] [AArch64] Vect: SLP fails to vectorize a loop as the reduction_latency calculated by new costs is too large

2023-07-28 Thread rsandifo at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=110625 --- Comment #9 from rsandifo at gcc dot gnu.org --- Created attachment 55653 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=55653=edit Candidate patch (part 1)

[Bug tree-optimization/110780] aarch64 NEON redundant displaced ld3

2023-07-24 Thread rsandifo at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=110780 rsandifo at gcc dot gnu.org changed: What|Removed |Added CC||rsandifo at gcc dot

[Bug tree-optimization/110897] RISC-V: Fail to vectorize shift

2023-08-04 Thread rsandifo at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=110897 --- Comment #12 from rsandifo at gcc dot gnu.org --- (In reply to JuzheZhong from comment #11) > You can see "_9 = _5 >> _8;". We should vectorize SImode instead of HImode. > The correct follow should be first extend HI -> SImode, Then

[Bug tree-optimization/110897] RISC-V: Fail to vectorize shift

2023-08-04 Thread rsandifo at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=110897 --- Comment #10 from rsandifo at gcc dot gnu.org --- (In reply to JuzheZhong from comment #9) > I seems that we must support widen shift pattern in RISCV port even though > we don't have widen shift instructions ? I doubt it. Seems like one

[Bug tree-optimization/110449] Vect: use a small step to calculate the loop induction if the loop is unrolled during loop vectorization

2023-06-28 Thread rsandifo at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=110449 rsandifo at gcc dot gnu.org changed: What|Removed |Added CC||rguenth at gcc dot

[Bug middle-end/106081] missed vectorization

2023-06-27 Thread rsandifo at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=106081 --- Comment #7 from rsandifo at gcc dot gnu.org --- I don't think the splat creates a new layout, but instead a splat should be allowed to change its layout at zero cost.

[Bug target/109661] [13 Regression] ICE in aarch64_function_arg_alignment when building erlang

2023-05-03 Thread rsandifo at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=109661 rsandifo at gcc dot gnu.org changed: What|Removed |Added Summary|[13/14 Regression] ICE in |[13 Regression] ICE in

[Bug target/109929] profiledbootstrap failure on aarch64-linux-gnu with graphite optimization

2024-01-22 Thread rsandifo at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=109929 --- Comment #7 from Richard Sandiford --- Hmm, yeah, like you say, neither of those commits should have made a different to whether bootstrap works. I guess the problem is just latent now.

[Bug target/113485] [14 regression] ICE with -fno-guess-branch-probability on aarch64 starting with r14-7187-g74e3e839ab2d36

2024-01-24 Thread rsandifo at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=113485 Richard Sandiford changed: What|Removed |Added Status|NEW |ASSIGNED --- Comment #7 from

[Bug target/113572] [14 Regression] aarch64: internal compiler error in aarch64_sve::vector_cst_all_same

2024-01-24 Thread rsandifo at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=113572 Richard Sandiford changed: What|Removed |Added Assignee|unassigned at gcc dot gnu.org |rsandifo at gcc dot gnu.org

[Bug tree-optimization/113281] [14 Regression] Wrong code due to vectorization of shift reduction and missing promotions since r14-3027

2024-01-24 Thread rsandifo at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=113281 Richard Sandiford changed: What|Removed |Added Assignee|unassigned at gcc dot gnu.org |rsandifo at gcc dot gnu.org

[Bug target/113763] [14 Regression] build fails with clang++ host compiler because aarch64.cc uses C++14 constexpr.

2024-02-06 Thread rsandifo at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=113763 --- Comment #11 from Richard Sandiford --- Currently away so can't try it myself, but how about just using an ad-hoc structure instead?

[Bug target/113763] [14 Regression] build fails with clang++ host compiler because aarch64.cc uses C++14 constexpr.

2024-02-06 Thread rsandifo at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=113763 --- Comment #14 from Richard Sandiford --- AFAIK, the constructor shouldn't be necessary. (And without it, the whole thing would fit on one line.) LGTM (and preapproved) otherwise. Thanks for doing this.

<    1   2   3   4   5   6   7   >