[Bug target/91708] [10 regression][ARM] Bootstrap fails in gen_movsi, at config/arm/arm.md:5258

2019-09-10 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=91708 --- Comment #13 from Wilco --- (In reply to Bernd Edlinger from comment #12) > Created attachment 46863 [details] > untested patch > > That was easy :-) > I have been there before... Great! That bootstraps successfully now.

[Bug target/91738] [10 regression] gcc.target/arm/pr53447-5.c fails since r274823

2019-09-11 Thread wilco at gcc dot gnu.org
||2019-09-11 CC||wilco at gcc dot gnu.org Assignee|unassigned at gcc dot gnu.org |wilco at gcc dot gnu.org Ever confirmed|0 |1 --- Comment #1 from Wilco --- (In reply to Christophe Lyon from comment

[Bug middle-end/91753] New: Bad register allocation of multi-register types

2019-09-12 Thread wilco at gcc dot gnu.org
: middle-end Assignee: unassigned at gcc dot gnu.org Reporter: wilco at gcc dot gnu.org Target Milestone: --- The following example shows that register allocation of types which require multiple registers is quite non-optimal: #include #include void neon_transform_nada(const

[Bug middle-end/91753] Bad register allocation of multi-register types

2019-09-13 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=91753 --- Comment #4 from Wilco --- (In reply to Andrew Pinski from comment #3) > (In reply to Wilco from comment #2) > > (In reply to Andrew Pinski from comment #1) > > > lower-subreg should have be able to help here. I wonder why it did not > > > .

[Bug tree-optimization/91776] `-fsplit-paths` generates slower code on arm

2019-09-16 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=91776 Wilco changed: What|Removed |Added CC||wilco at gcc dot gnu.org --- Comment #1 from

[Bug target/91766] -fvisibility=hidden during -fpic still uses GOT indirection on arm64

2019-09-17 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=91766 Wilco changed: What|Removed |Added CC||wilco at gcc dot gnu.org --- Comment #7 from

[Bug target/91766] -fvisibility=hidden during -fpic still uses GOT indirection on arm64

2019-09-17 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=91766 Wilco changed: What|Removed |Added Status|UNCONFIRMED |NEW Last reconfirmed|

[Bug middle-end/91708] [10 regression][ARM] Bootstrap fails in gen_movsi, at config/arm/arm.md:5258

2019-09-17 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=91708 --- Comment #16 from Wilco --- (In reply to Richard Earnshaw from comment #15) > So is this now fixed? On trunk yes. This is quite a nasty alias bug in CSE, so it will need to be backported.

[Bug middle-end/91708] [10 regression][ARM] Bootstrap fails in gen_movsi, at config/arm/arm.md:5258

2019-09-17 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=91708 --- Comment #18 from Wilco --- (In reply to Richard Earnshaw from comment #17) > So do we have a testcase that shows the problem on older compilers? Yes, the same testcase shows the same incorrect substitution in older compilers. I tried GCC9, b

[Bug tree-optimization/91776] `-fsplit-paths` generates slower code on arm

2019-09-18 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=91776 Wilco changed: What|Removed |Added Status|UNCONFIRMED |NEW Last reconfirmed|

[Bug middle-end/91708] [10 regression][ARM] Bootstrap fails in gen_movsi, at config/arm/arm.md:5258

2019-09-18 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=91708 --- Comment #20 from Wilco --- (In reply to Richard Earnshaw from comment #19) > (In reply to Wilco from comment #18) > > (In reply to Richard Earnshaw from comment #17) > > > So do we have a testcase that shows the problem on older compilers? >

[Bug middle-end/91708] [10 regression][ARM] Bootstrap fails in gen_movsi, at config/arm/arm.md:5258

2019-09-18 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=91708 --- Comment #22 from Wilco --- (In reply to Richard Earnshaw from comment #21) > But dropping in a char* will give a more restrictive alias set, so that > isn't wrong, even if it is suboptimal The alias set could be anything given CSE changes on

[Bug target/91738] [10 regression] gcc.target/arm/pr53447-5.c fails since r274823

2019-09-18 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=91738 --- Comment #2 from Wilco --- Author: wilco Date: Wed Sep 18 19:52:09 2019 New Revision: 275907 URL: https://gcc.gnu.org/viewcvs?rev=275907&root=gcc&view=rev Log: [ARM] Add logical DImode expanders We currently use default mid-end expanders for

[Bug target/91738] [10 regression] gcc.target/arm/pr53447-5.c fails since r274823

2019-09-19 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=91738 Wilco changed: What|Removed |Added Status|ASSIGNED|RESOLVED Resolution|---

[Bug target/91833] [10 Regression] [AArch64] LSE atomics depends on glibc specific sys/auxv.h

2019-10-10 Thread wilco at gcc dot gnu.org
||wilco at gcc dot gnu.org Resolution|--- |FIXED --- Comment #7 from Wilco --- Fixed

[Bug target/91834] [10 Regression ] [AArch64] LSE atomics, warnings due to unpredictable behavior with strx and the same register for s and t

2019-10-10 Thread wilco at gcc dot gnu.org
||wilco at gcc dot gnu.org Resolution|--- |FIXED --- Comment #6 from Wilco --- Fixed

[Bug c++/92045] New: [7 8 9 10 regression][C++11] valid alignas ignored with spurious warning

2019-10-10 Thread wilco at gcc dot gnu.org
Priority: P3 Component: c++ Assignee: unassigned at gcc dot gnu.org Reporter: wilco at gcc dot gnu.org Target Milestone: --- Since GCC7 alignas no longer works correctly if the requested alignment is larger than MAX_STACK_ALIGNMENT (which is 8 or 16 on most

[Bug c++/89357] [7/8/9/10 regression][C++11] alignas for automatic variables with alignment greater than 16 fails

2019-10-10 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=89357 Wilco changed: What|Removed |Added Summary|alignas for automatic |[7/8/9/10 |variables with alig

[Bug bootstrap/91034] In tree build of gmp fails on Raspberry Pi4 (ARM Cortex A72) with `mls r1,r4,r8,r11' not supported in ARM mode

2019-10-10 Thread wilco at gcc dot gnu.org
||wilco at gcc dot gnu.org Resolution|--- |INVALID --- Comment #17 from Wilco --- Not a GCC bug, just a gmp configuration oddity.

[Bug tree-optimization/88760] GCC unrolling is suboptimal

2019-10-11 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=88760 --- Comment #29 from Wilco --- (In reply to Jiu Fu Guo from comment #28) > For these kind of small loops, it would be acceptable to unroll in GIMPLE, > because register pressure and instruction cost may not be major concerns; > just like "cunrol

[Bug tree-optimization/88760] GCC unrolling is suboptimal

2019-10-11 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=88760 --- Comment #32 from Wilco --- (In reply to Segher Boessenkool from comment #31) > Gimple passes know a lot about machine details, too. > > Irrespective of if this is "low-level" or "high-level", it should be done > earlier than it is now. It s

[Bug tree-optimization/88760] GCC unrolling is suboptimal

2019-10-11 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=88760 --- Comment #34 from Wilco --- (In reply to rguent...@suse.de from comment #30) > On Fri, 11 Oct 2019, wilco at gcc dot gnu.org wrote: > > > https://gcc.gnu.org/bugzilla/show_bug.cgi?id=88760 > > > > --- Comment #29

[Bug rtl-optimization/42575] arm-eabi-gcc 64-bit multiply weirdness

2019-10-11 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=42575 Wilco changed: What|Removed |Added Status|REOPENED|RESOLVED Resolution|---

[Bug target/92075] extracting element from NEON float-vector moves to/from integer register

2019-10-14 Thread wilco at gcc dot gnu.org
||2019-10-14 CC||wilco at gcc dot gnu.org Ever confirmed|0 |1 --- Comment #1 from Wilco --- Confirmed. AArch64 gets this right so we should emit efficient code on Arm too.

[Bug target/92071] [10 regression][ARM] ice in gen_movsi, at config/arm/arm.md:5378

2019-10-16 Thread wilco at gcc dot gnu.org
||2019-10-16 CC||wilco at gcc dot gnu.org Target Milestone|--- |10.0 Summary|ice in gen_movsi, at|[10 regression][ARM] ice in |config/arm/arm.md:5378 |gen_movsi, at

[Bug target/81356] __builtin_strcpy is not good for copying an empty string on aarch64

2017-12-15 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=81356 Wilco changed: What|Removed |Added Status|NEW |RESOLVED Resolution|---

[Bug tree-optimization/83491] [8 Regression] ICE in execute_cse_reciprocals_1 at gcc/tree-ssa-math-opts.c:585

2017-12-19 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=83491 Wilco changed: What|Removed |Added Status|UNCONFIRMED |NEW Last reconfirmed|

[Bug tree-optimization/83491] [8 Regression] ICE in execute_cse_reciprocals_1 at gcc/tree-ssa-math-opts.c:585

2017-12-19 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=83491 --- Comment #3 from Wilco --- (In reply to Jakub Jelinek from comment #2) > There are multiple bugs: > 1) the callers of execute_cse_reciprocals_1 ensure that def is SSA_NAME, so > using: > /* If this is a square (x * x), we should check whethe

[Bug tree-optimization/83491] [8 Regression] ICE in execute_cse_reciprocals_1 at gcc/tree-ssa-math-opts.c:585

2017-12-20 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=83491 --- Comment #6 from Wilco --- Patch: https://gcc.gnu.org/ml/gcc-patches/2017-12/msg01357.html

[Bug tree-optimization/80724] [8 Regression] gcc.target/aarch64/pr62178.c failed because of r247885

2018-01-03 Thread wilco at gcc dot gnu.org
||wilco at gcc dot gnu.org Resolution|--- |FIXED --- Comment #3 from Wilco --- Latest trunk generates: .L7: ldr s2, [x1, 4]! ldr q1, [x0], 124 mla v0.4s, v1.4s, v2.s[0] cmp x0, x2 bne .L7 So

[Bug target/82439] Missing (x | y) == x simplifications

2018-01-05 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=82439 Wilco changed: What|Removed |Added Status|NEW |RESOLVED Resolution|---

[Bug target/83726] [8 Regression] ICE: in final_scan_insn, at final.c:3063

2018-01-08 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=83726 --- Comment #3 from Wilco --- This is related with PR82964/82974, looks like same underlying issue. I have a patch which changes the constraint, and that fixes this issue too. It's not obvious to me whether legitimate_constant_p should be a subse

[Bug target/82214] [AArch64] Incorrect checking of LDP/STP offsets in aarch64_print_operand

2018-01-09 Thread wilco at gcc dot gnu.org
||2018-01-09 CC||wilco at gcc dot gnu.org Ever confirmed|0 |1 --- Comment #1 from Wilco --- Note r255230 introduces 'y' and 'z' operand printing specifiers to solve this issue, however they are no

[Bug target/83726] [8 Regression] ICE: in final_scan_insn, at final.c:3063

2018-01-10 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=83726 --- Comment #6 from Wilco --- (In reply to Steve Ellcey from comment #4) > This looks like the same thing as PR 83632. It likely is, I don't see it failing with my patch. Older compilers don't appear to like the Fortran syntax...

[Bug fortran/82392] Allign arrays for faster execution

2018-01-10 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=82392 Wilco changed: What|Removed |Added CC||wilco at gcc dot gnu.org --- Comment #3 from

[Bug target/79041] aarch64 backend emits R_AARCH64_ADR_PREL_PG_HI21 relocation despite -mpc-relative-literal-loads option being used

2018-01-17 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=79041 --- Comment #15 from Wilco --- Author: wilco Date: Wed Jan 17 16:31:42 2018 New Revision: 256800 URL: https://gcc.gnu.org/viewcvs?rev=256800&root=gcc&view=rev Log: [AArch64] PR82964: Fix 128-bit immediate ICEs This fixes PR82964 which reports I

[Bug target/82964] gfortran.dg/class_array_1.f03 regression since r254388

2018-01-17 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=82964 --- Comment #2 from Wilco --- Author: wilco Date: Wed Jan 17 16:31:42 2018 New Revision: 256800 URL: https://gcc.gnu.org/viewcvs?rev=256800&root=gcc&view=rev Log: [AArch64] PR82964: Fix 128-bit immediate ICEs This fixes PR82964 which reports IC

[Bug target/83632] pdt_26.f03:29:0: internal compiler error: in final_scan_insn, at final.c:3063

2018-01-17 Thread wilco at gcc dot gnu.org
||wilco at gcc dot gnu.org Resolution|--- |FIXED --- Comment #3 from Wilco --- (In reply to Andreas Schwab from comment #0) > $ gcc/gfortran -Bgcc/ ../../gcc/gcc/testsuite/gfortran.dg/pdt_26.f03 > -mabi=lp64 -O3 -g -S Fixed by r256800

[Bug target/83726] [8 Regression] ICE: in final_scan_insn, at final.c:3063

2018-01-17 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=83726 Wilco changed: What|Removed |Added Status|NEW |RESOLVED Resolution|---

[Bug target/82964] gfortran.dg/class_array_1.f03 regression since r254388

2018-01-17 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=82964 Wilco changed: What|Removed |Added Status|ASSIGNED|RESOLVED Resolution|---

[Bug target/82964] gfortran.dg/class_array_1.f03 regression since r254388

2018-01-18 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=82964 --- Comment #5 from Wilco --- Author: wilco Date: Thu Jan 18 16:37:44 2018 New Revision: 256854 URL: https://gcc.gnu.org/viewcvs?rev=256854&root=gcc&view=rev Log: [AArch64] Fix fp16 test failures after PR82964 fix This fixes test failures in gc

[Bug middle-end/81657] [8 Regression] FAIL: gcc.dg/20050503-1.c scan-assembler-not call

2018-01-18 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=81657 Wilco changed: What|Removed |Added CC||wilco at gcc dot gnu.org --- Comment #5 from

[Bug middle-end/81657] [8 Regression] FAIL: gcc.dg/20050503-1.c scan-assembler-not call

2018-01-18 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=81657 --- Comment #7 from Wilco --- (In reply to H.J. Lu from comment #6) > (In reply to Wilco from comment #5) > > Note there are other optimizations which can block a tailcall, for example: > > > > void *f (void *p) { return __builtin_strchr (p, 0);

[Bug middle-end/81657] [8 Regression] FAIL: gcc.dg/20050503-1.c scan-assembler-not call

2018-01-18 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=81657 --- Comment #9 from Wilco --- (In reply to Jakub Jelinek from comment #8) > That just means r240568 caused another regression. > Again, on various targets strchr is efficient, just on a few ones it is not > and the change was unfortunately done g

[Bug middle-end/81657] [8 Regression] FAIL: gcc.dg/20050503-1.c scan-assembler-not call

2018-01-18 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=81657 --- Comment #12 from Wilco --- (In reply to H.J. Lu from comment #10) > (In reply to Wilco from comment #9) > > (In reply to Jakub Jelinek from comment #8) > > > That just means r240568 caused another regression. > > > Again, on various targets s

[Bug middle-end/81657] [8 Regression] FAIL: gcc.dg/20050503-1.c scan-assembler-not call

2018-01-19 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=81657 --- Comment #15 from Wilco --- (In reply to H.J. Lu from comment #13) > (In reply to Wilco from comment #12) > > > > > > > Do you have data to show that? > > > > Yes, on x64 I get these timings for a simple function containing just the > > lib

[Bug middle-end/81657] [8 Regression] FAIL: gcc.dg/20050503-1.c scan-assembler-not call

2018-01-19 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=81657 --- Comment #14 from Wilco --- (In reply to Jakub Jelinek from comment #11) > No matter what, I don't see how you could use much common infrastructure > here. > Say if the tailcall pass sees strlen (something) + something being returned, > it cou

[Bug middle-end/81657] [8 Regression] FAIL: gcc.dg/20050503-1.c scan-assembler-not call

2018-01-19 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=81657 --- Comment #17 from Wilco --- (In reply to Jakub Jelinek from comment #16) > (In reply to Wilco from comment #15) > > I don't think it's safe to compare different benchmark results like that. > > But yes the kernel for both should be very simila

[Bug target/79262] [6/7/8 Regression] load gap with store gap causing performance regression in 462.libquantum

2018-01-19 Thread wilco at gcc dot gnu.org
||2018-01-19 CC||wilco at gcc dot gnu.org Ever confirmed|0 |1 --- Comment #5 from Wilco --- Commit related to this: commit a1b0b75cfc4f82162fc7e9b1f7255c98359a1078 Author: pinskia Date: Wed Feb 1 18:30:50 2017 +

[Bug middle-end/78809] Inline strcmp with small constant strings

2018-01-19 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=78809 --- Comment #25 from Wilco --- (In reply to Qing Zhao from comment #24) > From the above, we can see: > even with n is as big as 20, inlined version is much faster than the > non-inlined version, both on aarch64 (no hardware string compare in

[Bug middle-end/84029] New: Partially inline strcmp

2018-01-24 Thread wilco at gcc dot gnu.org
Assignee: unassigned at gcc dot gnu.org Reporter: wilco at gcc dot gnu.org Target Milestone: --- Initial performance results for PR78809 suggest that partial inlining of strcmp may be beneficial. We could inline the first character comparison before calling strcmp: if ((res = s[0] - t[0

[Bug middle-end/84029] Partially inline strcmp

2018-01-25 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=84029 --- Comment #2 from Wilco --- (In reply to Richard Biener from comment #1) > If we know the length of either argument and the other argument is properly > aligned we can do a 2, 4 or 8 byte compare upfront. Not sure how often that > happens (esp

[Bug middle-end/78809] Inline strcmp with small constant strings

2018-01-25 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=78809 --- Comment #29 from Wilco --- (In reply to Qing Zhao from comment #28) > > > > I don't think this is a good test. Repeatedly calling strcmp with the same > > inputs is not something real code does, especially when the string matches > > exactl

[Bug middle-end/84071] New: [7/8 regression] nonzero_bits1 of subreg incorrect

2018-01-26 Thread wilco at gcc dot gnu.org
: middle-end Assignee: unassigned at gcc dot gnu.org Reporter: wilco at gcc dot gnu.org Target Milestone: --- PR59461 changed nonzero_bits1 incorrectly for subregs: /* On many CISC machines, accessing an object in a wider mode causes the high-order bits

[Bug middle-end/78809] Inline strcmp with small constant strings

2018-01-29 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=78809 --- Comment #31 from Wilco --- (In reply to Qing Zhao from comment #30) > (in reply to Wilco from comment #29) > > > > The new test is better, however it uses i % 15 which means an expensive > > division by constant every loop iteration. It's be

[Bug middle-end/84071] [7/8 regression] nonzero_bits1 of subreg incorrect

2018-01-29 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=84071 --- Comment #5 from Wilco --- (In reply to Eric Botcazou from comment #3) > > PR59461 changed nonzero_bits1 incorrectly for subregs: > > > > /* On many CISC machines, accessing an object in a wider mode > > causes the high

[Bug rtl-optimization/84068] [8 Regression] ICE: qsort checking failed: qsort comparator non-negative on sorted output: 1 with -fno-sched-critical-path-heuristic --param=max-sched-extend-regions-iters

2018-01-30 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=84068 Wilco changed: What|Removed |Added Assignee|unassigned at gcc dot gnu.org |wilco at gcc dot gnu.org --- Comment #3

[Bug rtl-optimization/84071] [7/8 regression] wrong elimination of zero-extension after sign-extended load

2018-01-31 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=84071 --- Comment #15 from Wilco --- (In reply to Eric Botcazou from comment #10) > > The addition is performed on the full 32-bit register, so this obviously > > means that the top 24 bits have an undefined value. > > Not if the entire registers have

[Bug rtl-optimization/84071] [7/8 regression] wrong elimination of zero-extension after sign-extended load

2018-01-31 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=84071 --- Comment #19 from Wilco --- (In reply to Eric Botcazou from comment #16) > > Also I wonder whether this means AArch64 should set it since targets like > > MIPS > > and Sparc already set it. > > There seems to be a good reason against that:

[Bug rtl-optimization/83459] [8 Regression] ICE: qsort checking failed: qsort comparator non-negative on sorted output: 1 with --param=sched-pressure-algorithm=2

2018-02-08 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=83459 --- Comment #3 from Wilco --- Author: wilco Date: Thu Feb 8 12:29:28 2018 New Revision: 257481 URL: https://gcc.gnu.org/viewcvs?rev=257481&root=gcc&view=rev Log: PR84068, PR83459: Fix sort order of SCHED_PRESSURE_MODEL The comparison function

[Bug rtl-optimization/84068] [8 Regression] ICE: qsort checking failed: qsort comparator non-negative on sorted output: 1 with -fno-sched-critical-path-heuristic --param=max-sched-extend-regions-iters

2018-02-08 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=84068 --- Comment #5 from Wilco --- Author: wilco Date: Thu Feb 8 12:29:28 2018 New Revision: 257481 URL: https://gcc.gnu.org/viewcvs?rev=257481&root=gcc&view=rev Log: PR84068, PR83459: Fix sort order of SCHED_PRESSURE_MODEL The comparison function

[Bug rtl-optimization/84068] [8 Regression] ICE: qsort checking failed: qsort comparator non-negative on sorted output: 1 with -fno-sched-critical-path-heuristic --param=max-sched-extend-regions-iters

2018-02-08 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=84068 Wilco changed: What|Removed |Added Status|NEW |RESOLVED Resolution|---

[Bug middle-end/82407] [meta-bug] qsort_chk fallout tracking

2018-02-08 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=82407 Bug 82407 depends on bug 83459, which changed state. Bug 83459 Summary: [8 Regression] ICE: qsort checking failed: qsort comparator non-negative on sorted output: 1 with --param=sched-pressure-algorithm=2 https://gcc.gnu.org/bugzilla/show_bug.cgi

[Bug rtl-optimization/83459] [8 Regression] ICE: qsort checking failed: qsort comparator non-negative on sorted output: 1 with --param=sched-pressure-algorithm=2

2018-02-08 Thread wilco at gcc dot gnu.org
||wilco at gcc dot gnu.org Resolution|--- |FIXED --- Comment #4 from Wilco --- Fixed in r257481.

[Bug target/89222] [7.x regression] ARM thumb-2 misoptimisation of func ptr call with -O2 or -Os

2019-02-08 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=89222 Wilco changed: What|Removed |Added CC||wilco at gcc dot gnu.org --- Comment #5 from

[Bug target/89222] [7.x regression] ARM thumb-2 misoptimisation of func ptr call with -O2 or -Os

2019-02-08 Thread wilco at gcc dot gnu.org
||2019-02-08 Assignee|unassigned at gcc dot gnu.org |wilco at gcc dot gnu.org Ever confirmed|0 |1

[Bug target/89222] [7/8/9 regression] ARM thumb-2 misoptimisation of func ptr call with -O2 or -Os

2019-02-11 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=89222 --- Comment #7 from Wilco --- Patch: https://gcc.gnu.org/ml/gcc-patches/2019-02/msg00780.html

[Bug tree-optimization/86637] [9 Regression] ICE: tree check: expected block, have in inlining_chain_to_json, at optinfo-emit-json.cc:293

2019-02-11 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=86637 --- Comment #14 from Wilco --- Author: wilco Date: Mon Feb 11 18:14:37 2019 New Revision: 268777 URL: https://gcc.gnu.org/viewcvs?rev=268777&root=gcc&view=rev Log: [COMMITTED] Fix pthread errors in pr86637-2.c Fix test errors on targets which d

[Bug target/89222] [7/8/9 regression] ARM thumb-2 misoptimisation of func ptr call with -O2 or -Os

2019-02-13 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=89222 --- Comment #8 from Wilco --- (In reply to Wilco from comment #7) > Patch: https://gcc.gnu.org/ml/gcc-patches/2019-02/msg00780.html Updated patch: https://gcc.gnu.org/ml/gcc-patches/2019-02/msg00947.html

[Bug target/89190] [8/9 regression][ARM] armv8-m.base invalid ldm ICE

2019-02-13 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=89190 --- Comment #2 from Wilco --- Author: wilco Date: Wed Feb 13 16:22:25 2019 New Revision: 268848 URL: https://gcc.gnu.org/viewcvs?rev=268848&root=gcc&view=rev Log: [ARM] Fix Thumb-1 ldm (PR89190) This patch fixes an ICE in the Thumb-1 LDM peepho

[Bug target/85711] [8 regression] ICE in aarch64_classify_address, at config/aarch64/aarch64.c:5678

2019-02-15 Thread wilco at gcc dot gnu.org
||2019-02-15 CC||wilco at gcc dot gnu.org Summary|ICE in |[8 regression] ICE in |aarch64_classify_address, |aarch64_classify_address, |at |at

[Bug middle-end/89037] checking ice emitting 128-bit bit-field initializer

2019-02-19 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=89037 Wilco changed: What|Removed |Added CC||wilco at gcc dot gnu.org Version|9.0

[Bug libfortran/78314] [aarch64] ieee_support_halting does not report unsupported fpu traps correctly

2019-02-19 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=78314 Wilco changed: What|Removed |Added CC||wilco at gcc dot gnu.org --- Comment #23 from

[Bug libfortran/78314] [aarch64] ieee_support_halting does not report unsupported fpu traps correctly

2019-02-20 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=78314 --- Comment #25 from Wilco --- (In reply to Steve Ellcey from comment #24) > See email strings at: > > https://gcc.gnu.org/ml/fortran/2019-01/msg00276.html > https://gcc.gnu.org/ml/fortran/2019-02/msg00057.html > > For more discussion. Sure, i

[Bug tree-optimization/89437] New: [9 regression] incorrect result for sinl (atanl (x))

2019-02-21 Thread wilco at gcc dot gnu.org
Component: tree-optimization Assignee: unassigned at gcc dot gnu.org Reporter: wilco at gcc dot gnu.org Target Milestone: --- A recently added optimization uses an inline expansion for sinl (atanl (x)). As it involves computing sqrtl (x * x + 1) which can overflow for large x, there

[Bug tree-optimization/86829] Missing sin(atan(x)) and cos(atan(x)) optimizations

2019-02-21 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=86829 Wilco changed: What|Removed |Added CC||wilco at gcc dot gnu.org --- Comment #8 from

[Bug tree-optimization/89437] [9 regression] incorrect result for sinl (atanl (x))

2019-03-04 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=89437 --- Comment #1 from Wilco --- Author: wilco Date: Mon Mar 4 12:36:04 2019 New Revision: 269364 URL: https://gcc.gnu.org/viewcvs?rev=269364&root=gcc&view=rev Log: Fix PR89437 Fix PR89437. Fix the sinatan-1.c testcase to not run without a C99 ta

[Bug tree-optimization/89437] [9 regression] incorrect result for sinl (atanl (x))

2019-03-04 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=89437 Wilco changed: What|Removed |Added Status|UNCONFIRMED |RESOLVED Resolution|---

[Bug target/89222] [7/8/9 regression] ARM thumb-2 misoptimisation of func ptr call with -O2 or -Os

2019-03-05 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=89222 --- Comment #9 from Wilco --- Author: wilco Date: Tue Mar 5 15:04:01 2019 New Revision: 269390 URL: https://gcc.gnu.org/viewcvs?rev=269390&root=gcc&view=rev Log: [ARM] Fix PR89222 The GCC optimizer can generate symbols with non-zero offset fro

[Bug target/89222] [7/8 regression] ARM thumb-2 misoptimisation of func ptr call with -O2 or -Os

2019-03-05 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=89222 Wilco changed: What|Removed |Added Target Milestone|--- |8.5 Summary|[7/8/9 regression] ARM

[Bug target/89752] [8/9 Regression] ICE in emit_move_insn, at expr.c:3723

2019-03-18 Thread wilco at gcc dot gnu.org
||2019-03-18 CC||wilco at gcc dot gnu.org Ever confirmed|0 |1 --- Comment #2 from Wilco --- Confirmed. It ICEs in Eigen::internal::gebp_kernel, 2, 4, false, false>::operator() It seems to choke on this asm dur

[Bug target/89752] [8/9 Regression] ICE in emit_move_insn, at expr.c:3723

2019-03-18 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=89752 --- Comment #3 from Wilco --- Full instruction: (insn 531 530 532 19 (parallel [ (set (mem/c:BLK (reg:DI 3842) [29 A0+0 S2 A64]) (asm_operands:BLK ("") ("=rm") 0 [ (mem/c:BLK (reg:DI 3846) [29

[Bug target/89752] [8/9 Regression] ICE in emit_move_insn, at expr.c:3723

2019-03-18 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=89752 --- Comment #4 from Wilco --- Small example which generates the same ICE on every GCC version: typedef struct { int x, y, z; } X; void f(void) { X A0, A1; __asm__ ("" : [a0] "+rm" (A0),[a1] "+rm" (A1)); } So it's completely invalid inline

[Bug target/89752] [8/9 Regression] ICE in emit_move_insn, at expr.c:3723

2019-03-18 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=89752 --- Comment #10 from Wilco --- It seems that rewriting "+rm" into "=rm" and "0" is not equivalent. Eg. __asm__ ("" : [a0] "=m" (A0) : "0" (A0)); gives a million warnings "matching constraint does not allow a register", so "0" appears to imply

[Bug ada/89493] [9 Regression] Stack smashing on armv7hl

2019-03-19 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=89493 Wilco changed: What|Removed |Added CC||wilco at gcc dot gnu.org --- Comment #2 from

[Bug target/89607] Missing optimization for store of multiple registers on aarch64

2019-03-19 Thread wilco at gcc dot gnu.org
||wilco at gcc dot gnu.org Resolution|--- |FIXED Target Milestone|--- |9.0 --- Comment #9 from Wilco --- Fixed in GCC9 already, so closing.

[Bug target/88834] [SVE] Poor addressing mode choices for LD2 and ST2

2019-03-28 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=88834 Wilco changed: What|Removed |Added CC||wilco at gcc dot gnu.org --- Comment #11 from

[Bug target/88834] [SVE] Poor addressing mode choices for LD2 and ST2

2019-04-09 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=88834 --- Comment #16 from Wilco --- (In reply to kugan from comment #15) > (In reply to Wilco from comment #11) > > There is also something odd with the way the loop iterates, this doesn't > > look right: > > > > whilelo p0.s, x3, x4 > >

[Bug target/81800] [8/9 regression] on aarch64 ilp32 lrint should not be inlined as two instructions

2019-04-11 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=81800 --- Comment #14 from Wilco --- (In reply to Jakub Jelinek from comment #13) > Patches should be pinged after a week if they aren't reviewed, furthermore, > it is better to CC explicitly relevant maintainers. I've got about 10 patches waiting, I'

[Bug target/81800] [8/9 regression] on aarch64 ilp32 lrint should not be inlined as two instructions

2019-04-11 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=81800 --- Comment #16 from Wilco --- (In reply to Jakub Jelinek from comment #15) > (In reply to Wilco from comment #14) > > (In reply to Jakub Jelinek from comment #13) > > > Patches should be pinged after a week if they aren't reviewed, > > > furthe

[Bug rtl-optimization/87871] [9 Regression] testcases fail after r265398 on arm

2019-04-12 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=87871 --- Comment #19 from Wilco --- (In reply to Peter Bergner from comment #18) > (In reply to Segher Boessenkool from comment #15) > > Popping a5(r116,l0) -- assign reg 3 > > Popping a3(r112,l0) -- assign reg 4 > > Popping a2(r11

[Bug rtl-optimization/87871] [9 Regression] testcases fail after r265398 on arm

2019-04-12 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=87871 --- Comment #21 from Wilco --- (In reply to Vladimir Makarov from comment #20) > (In reply to Wilco from comment #19) > > (In reply to Peter Bergner from comment #18) > > > (In reply to Segher Boessenkool from comment #15) > > > > Popping a

[Bug rtl-optimization/87763] [9 Regression] aarch64 target testcases fail after r265398

2019-04-14 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=87763 --- Comment #52 from Wilco --- (In reply to Jeffrey A. Law from comment #49) > I think the insv_1 (and it's closely related insv_2) regressions can be > fixed by a single ior/and pattern in the backend or by hacking up combine a > bit. I'm still

[Bug rtl-optimization/87763] [9 Regression] aarch64 target testcases fail after r265398

2019-04-17 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=87763 --- Comment #54 from Wilco --- (In reply to Jeffrey A. Law from comment #53) > Realistically the register allocation issues are not going to get addressed > this cycle nor are improvements to the overall handling of RMW insns in > combine. So we

[Bug rtl-optimization/87871] [9 Regression] testcases fail after r265398 on arm

2019-04-18 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=87871 --- Comment #38 from Wilco --- (In reply to Segher Boessenkool from comment #37) > Yes, it is a balancing act. Which option works better? Well the question really is what is bad about movsi_compare0 that could be easily fixed? The move is for

[Bug rtl-optimization/87871] [9 Regression] testcases fail after r265398 on arm

2019-04-18 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=87871 --- Comment #47 from Wilco --- (In reply to Segher Boessenkool from comment #46) > With all three patches together (Peter's, mine, Jakub's), I get a code size > increase of only 0.047%, much more acceptable. Now looking what that diff > really *

[Bug rtl-optimization/87871] [9 Regression] testcases fail after r265398 on arm

2019-04-18 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=87871 --- Comment #52 from Wilco --- (In reply to Segher Boessenkool from comment #48) > With just Peter's and Jakub's patch, it *improves* code size by 0.090%. > That does not fix this PR though :-/ But it does fix most of the codesize regression. Th

[Bug middle-end/90262] New: Inline small constant memmoves

2019-04-26 Thread wilco at gcc dot gnu.org
Assignee: unassigned at gcc dot gnu.org Reporter: wilco at gcc dot gnu.org Target Milestone: --- GCC does not inline fixed-size memmoves. However memmove can be as easily inlined as memcpy. The existing memcpy infrastructure could be reused/expanded for this - all loads would be

[Bug middle-end/90263] New: Calls to mempcpy should use memcpy

2019-04-26 Thread wilco at gcc dot gnu.org
Assignee: unassigned at gcc dot gnu.org Reporter: wilco at gcc dot gnu.org Target Milestone: --- While GCC now inlines fixed-size mempcpy like memcpy, GCC still emits calls to mempcpy rather than converting to memcpy. Since most libraries, including GLIBC, do not have optimized

[Bug middle-end/90263] Calls to mempcpy should use memcpy

2019-04-26 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=90263 --- Comment #2 from Wilco --- (In reply to Jakub Jelinek from comment #1) > As stated several times in the past, I strongly disagree. Why? GCC already does this for bzero and bcopy.

<    1   2   3   4   5   6   7   8   >