[Bug tree-optimization/88398] vectorization failure for a small loop to do byte comparison

2019-01-08 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=88398 --- Comment #17 from Wilco --- (In reply to rguent...@suse.de from comment #16) > But you can't elide the checks in the peeled copies and for 4-times > unrolling you have most cases exiting on the first or fourth check. See comment #8 for an ex

[Bug tree-optimization/88398] vectorization failure for a small loop to do byte comparison

2019-01-08 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=88398 --- Comment #19 from Wilco --- (In reply to Jakub Jelinek from comment #18) > The duffs device doesn't need to be done with computed jump, it can be done > with 3 conditional branches + 3 comparisons too. The advantage of doing > that is especia

[Bug tree-optimization/88398] vectorization failure for a small loop to do byte comparison

2019-01-08 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=88398 --- Comment #20 from Wilco --- I see Kyrill added some examples that show LLVM knows how to unroll loops: https://gcc.gnu.org/bugzilla/show_bug.cgi?id=88760 This kind of thing is much worse than a trailing loop, both for branch prediction and co

[Bug tree-optimization/88739] [7/8/9 Regression] Big-endian union bug

2019-01-08 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=88739 --- Comment #19 from Wilco --- (In reply to Segher Boessenkool from comment #18) > Well, it is always possible to generate code with the opposite endianness to > what the hardware "wants". It just won't be very fast code. > > BITS_BIG_ENDIAN is

[Bug tree-optimization/88739] [7/8/9 Regression] Big-endian union bug

2019-01-08 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=88739 --- Comment #21 from Wilco --- (In reply to Eric Botcazou from comment #20) > > BITS_BIG_ENDIAN is just a convenience to the target code writer. The other > > four do matter, and are quite obvious really (and all four are necessary). > > Yes, I

[Bug tree-optimization/88739] [7/8/9 Regression] Big-endian union bug

2019-01-09 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=88739 --- Comment #25 from Wilco --- (In reply to rguent...@suse.de from comment #17) > On Tue, 8 Jan 2019, wilco at gcc dot gnu.org wrote: > > > https://gcc.gnu.org/bugzilla/show_bug.cgi?id=88739 > > > > --- Comment #16 fro

[Bug tree-optimization/88739] [7/8/9 Regression] Big-endian union bug

2019-01-09 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=88739 --- Comment #27 from Wilco --- (In reply to Eric Botcazou from comment #22) > > Is it really pure RTL, therefore not used in tree? So the above patch using > > BITS_BIG_ENDIAN for tree stuff would be incorrect to use it? > > I wouldn't say incor

[Bug tree-optimization/88739] [7/8/9 Regression] Big-endian union bug

2019-01-09 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=88739 --- Comment #29 from Wilco --- (In reply to Richard Biener from comment #26) > Did anybody test the patch? Testing on x86_64 will be quite pointless... Well that generates _18 = BIT_FIELD_REF <_2, 16, 14>; and becomes: ubfxx1, x20, 2, 16

[Bug tree-optimization/88760] GCC unrolling is suboptimal

2019-01-09 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=88760 Wilco changed: What|Removed |Added CC||wilco at gcc dot gnu.org --- Comment #4 from

[Bug tree-optimization/88760] GCC unrolling is suboptimal

2019-01-09 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=88760 --- Comment #5 from Wilco --- (In reply to Wilco from comment #4) > (In reply to ktkachov from comment #2) > > Created attachment 45386 [details] > > aarch64-llvm output with -Ofast -mcpu=cortex-a57 > > > > I'm attaching the full LLVM aarch64 ou

[Bug tree-optimization/88760] GCC unrolling is suboptimal

2019-01-09 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=88760 --- Comment #7 from Wilco --- (In reply to rguent...@suse.de from comment #6) > On Wed, 9 Jan 2019, wilco at gcc dot gnu.org wrote: > > > https://gcc.gnu.org/bugzilla/show_bug.cgi?id=88760 > > > > --- Comment #5 from

[Bug tree-optimization/88739] [7/8/9 Regression] Big-endian union bug

2019-01-09 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=88739 --- Comment #33 from Wilco --- (In reply to Richard Biener from comment #32) > > > > Index: gcc/expr.c > > === > > --- gcc/expr.c (revision 267553) > > +++ gcc/expr.c (working co

[Bug tree-optimization/88739] [7/8/9 Regression] Big-endian union bug

2019-01-09 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=88739 --- Comment #34 from Wilco --- With just the expr.c patch the gcc regression tests all pass on big-endian AArch64. Interestingly this includes the new torture test, ie. it does not trigger the union bug.

[Bug tree-optimization/88739] [7/8/9 Regression] Big-endian union bug

2019-01-09 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=88739 --- Comment #37 from Wilco --- (In reply to rsand...@gcc.gnu.org from comment #35) > Yeah, the expr.c patch makes the original testcase work, but we still fail > for: That's the folding in ccp1 after inlining, which will require a similar fix. T

[Bug tree-optimization/88760] GCC unrolling is suboptimal

2019-01-17 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=88760 --- Comment #14 from Wilco --- (In reply to rguent...@suse.de from comment #13) > Usually the peeling is done to improve branch prediction on the > prologue/epilogue. Modern branch predictors do much better on a loop than with this kind of code

[Bug tree-optimization/88760] GCC unrolling is suboptimal

2019-01-17 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=88760 --- Comment #16 from Wilco --- (In reply to rguent...@suse.de from comment #15) > which is what I refered to for branch prediction. Your & prompts me > to a way to do sth similar as duffs device, turning the loop into a nest. > > head: >i

[Bug middle-end/88560] [9 Regression] armv8_2-fp16-move-1.c and related regressions after r266385

2019-01-18 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=88560 --- Comment #6 from Wilco --- (In reply to Vladimir Makarov from comment #5) > We have too many tests checking expected generated code. We should more > focus on overall effect of the change. SPEC would be a good criterium > although it is hard

[Bug rtl-optimization/87763] [9 Regression] aarch64 target testcases fail after r265398

2019-01-21 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=87763 --- Comment #11 from Wilco --- A SPEC2006 run shows the codesize cost of make_more_copies is 0.05%. Practically all tests are worse, the largest increases are perlbench at 0.20%, gromacs 0.12%, calculix 0.12%, soplex 0.08%, xalancbmk 0.07%, wrf 0

[Bug rtl-optimization/87763] [9 Regression] aarch64 target testcases fail after r265398

2019-01-21 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=87763 --- Comment #13 from Wilco --- (In reply to Segher Boessenkool from comment #12) > Before the change combine forwarded all argument (etc.) hard registers > wherever > it could, doing part of RA's job (and doing a lousy job of it). If after the >

[Bug rtl-optimization/87763] [9 Regression] aarch64 target testcases fail after r265398

2019-01-21 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=87763 --- Comment #17 from Wilco --- (In reply to Vladimir Makarov from comment #14) > I've checked cvtf_1.c generated code and I don't see additional fmov > anymore. I guess it was fixed by an ira-costs.c change (a special > consideration of moves

[Bug rtl-optimization/87763] [9 Regression] aarch64 target testcases fail after r265398

2019-01-21 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=87763 --- Comment #19 from Wilco --- (In reply to Segher Boessenkool from comment #18) > https://gcc.gnu.org/ml/gcc/2019-01/msg00112.html Thanks, I hadn't noticed that yet... I need to look at it in more detail, but are you saying that combine no long

[Bug rtl-optimization/87763] [9 Regression] aarch64 target testcases fail after r265398

2019-01-22 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=87763 --- Comment #22 from Wilco --- (In reply to Steve Ellcey from comment #21) > If I look at this specific example: > > int f2 (int x, int y) > { > return (x & ~0x0ff000) | ((y & 0x0ff) << 12); > } > > Is this because of x0 (a hard register) at

[Bug rtl-optimization/87763] [9 Regression] aarch64 target testcases fail after r265398

2019-01-22 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=87763 --- Comment #23 from Wilco --- Author: wilco Date: Tue Jan 22 17:49:46 2019 New Revision: 268159 URL: https://gcc.gnu.org/viewcvs?rev=268159&root=gcc&view=rev Log: Fix vect-nop-move.c test Fix a failing test - changes in Combine mean the test n

[Bug tree-optimization/88760] GCC unrolling is suboptimal

2019-01-24 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=88760 --- Comment #19 from Wilco --- (In reply to rguent...@suse.de from comment #18) > > 1) Unrolling for load-pair-forming vectorisation (Richard Sandiford's > > suggestion) > > If that helps, sure (I'd have guessed uarchs are going to split > load

[Bug tree-optimization/88760] GCC unrolling is suboptimal

2019-01-24 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=88760 --- Comment #21 from Wilco --- (In reply to rguent...@suse.de from comment #20) > On Thu, 24 Jan 2019, wilco at gcc dot gnu.org wrote: > > > https://gcc.gnu.org/bugzilla/show_bug.cgi?id=88760 > > > > --- Comment #19

[Bug rtl-optimization/87763] [9 Regression] aarch64 target testcases fail after r265398

2019-01-25 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=87763 --- Comment #32 from Wilco --- Author: wilco Date: Fri Jan 25 13:29:06 2019 New Revision: 268265 URL: https://gcc.gnu.org/viewcvs?rev=268265&root=gcc&view=rev Log: [PATCH][AArch64] Fix generation of tst (PR87763) The TST instruction no longer m

[Bug tree-optimization/88760] GCC unrolling is suboptimal

2019-01-25 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=88760 --- Comment #23 from Wilco --- (In reply to ktkachov from comment #22) > helps even more. On Cortex-A72 it gives a bit more than 6% (vs 3%) > improvement on parest, and about 5.3% on a more aggressive CPU. > I tried unrolling 8x in a similar man

[Bug target/89101] [Aarch64] vfmaq_laneq_f32 generates unnecessary dup instrcutions

2019-01-29 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=89101 Wilco changed: What|Removed |Added CC||wilco at gcc dot gnu.org --- Comment #1 from

[Bug target/89101] [Aarch64] vfmaq_laneq_f32 generates unnecessary dup instrcutions

2019-01-29 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=89101 Wilco changed: What|Removed |Added Status|WAITING |NEW Known to work|

[Bug ipa/89104] ICE: Segmentation fault (in tree_int_cst_elt_check)

2019-01-30 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=89104 Wilco changed: What|Removed |Added CC||wilco at gcc dot gnu.org --- Comment #3 from

[Bug ipa/89104] ICE: Segmentation fault (in tree_int_cst_elt_check)

2019-01-31 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=89104 --- Comment #5 from Wilco --- (In reply to Jakub Jelinek from comment #4) > I really don't like these aarch64 warnings, declare simd is an optimization > (admittedly with ABI consequences) and warning about this by default is > weird, > + it is g

[Bug target/89190] New: [8/9 regression][ARM] armv8-m.base invalid ldm ICE

2019-02-04 Thread wilco at gcc dot gnu.org
: target Assignee: unassigned at gcc dot gnu.org Reporter: wilco at gcc dot gnu.org Target Milestone: --- The following testcases ICEs with -march=armv8-m.base on arm.none.eabi: long long a; int b, c; int d(int e, int f) { return e << f; } void g() { long long h;

[Bug target/89190] [8/9 regression][ARM] armv8-m.base invalid ldm ICE

2019-02-04 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=89190 Wilco changed: What|Removed |Added Status|UNCONFIRMED |ASSIGNED Last reconfirmed|

[Bug rtl-optimization/89195] New: [7/8/9 regression] Corrupted stack offset after combine

2019-02-04 Thread wilco at gcc dot gnu.org
Priority: P3 Component: rtl-optimization Assignee: unassigned at gcc dot gnu.org Reporter: wilco at gcc dot gnu.org Target Milestone: --- The following testcase generates incorrect stack offsets on AArch64 since GCC7 when compiled with -O1 -mbig-endian: struct S

[Bug rtl-optimization/89195] [7/8/9 regression] Corrupted stack offset after combine

2019-02-04 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=89195 Wilco changed: What|Removed |Added Target||aarch64 Target Milestone|---

[Bug rtl-optimization/89195] [7/8/9 regression] Corrupted stack offset after combine

2019-02-04 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=89195 --- Comment #1 from Wilco --- make_extraction does: if (MEM_P (inner)) { poly_int64 offset; /* POS counts from lsb, but make OFFSET count in memory order. */ if (BYTES_BIG_ENDIAN) offset

[Bug rtl-optimization/89195] [7/8/9 regression] Corrupted stack offset after combine

2019-02-05 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=89195 --- Comment #4 from Wilco --- (In reply to Segher Boessenkool from comment #3) > (In reply to Wilco from comment #1) > > len is unsigned HOST_WIDE_INT, so bits_to_bytes_round_down does an unsigned > > division... > > That shouldn't make a differ

[Bug rtl-optimization/89195] [7/8/9 regression] Corrupted stack offset after combine

2019-02-05 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=89195 --- Comment #10 from Wilco --- (In reply to Jakub Jelinek from comment #8) > Created attachment 45606 [details] > gcc9-pr89195.patch > > Now in patch form (untested so far). That works fine indeed. It avoids accessing the object out of bounds b

[Bug rtl-optimization/89195] [7/8/9 regression] Corrupted stack offset after combine

2019-02-05 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=89195 --- Comment #11 from Wilco --- (In reply to Segher Boessenkool from comment #9) > That patch is pre-approved if it regchecks fine (on more than just x86). > Thanks! check-gcc is clean on aarch64_be-none-elf

[Bug rtl-optimization/87871] [9 Regression] testcases fail after r265398 on arm

2019-02-06 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=87871 Wilco changed: What|Removed |Added CC||wilco at gcc dot gnu.org --- Comment #8 from

[Bug target/85434] Address of stack protector guard spilled to stack on ARM

2018-04-17 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=85434 Wilco changed: What|Removed |Added CC||wilco at gcc dot gnu.org --- Comment #4 from

[Bug target/85666] gcc-8.0.1 fails to build mmix target: gcc/libgcc/libgcc2.h:203:20: internal compiler error: in leaf_function_p, at final.c:4488

2018-05-08 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=85666 Wilco changed: What|Removed |Added CC||wilco at gcc dot gnu.org --- Comment #3 from

[Bug tree-optimization/70291] muldc3 code generation could be smarter

2018-05-09 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=70291 --- Comment #6 from Wilco --- (In reply to ktkachov from comment #4) > Implemented for GCC 9. Since multiple people seem interested in the improvement, would it be useful to backport?

[Bug target/85769] [8/9 Regression] ICE in extract_constrain_insn, at recog.c:2205 for -mcpu=thunderx

2018-05-15 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=85769 Wilco changed: What|Removed |Added CC||wilco at gcc dot gnu.org --- Comment #3 from

[Bug target/85666] gcc-8.0.1 fails to build mmix target: gcc/libgcc/libgcc2.h:203:20: internal compiler error: in leaf_function_p, at final.c:4488

2018-05-21 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=85666 --- Comment #8 from Wilco --- (In reply to Hans-Peter Nilsson from comment #7) > Thank you for your interest in the MMIX port. > > (In reply to Wilco from comment #3) > > (In reply to Sergei Trofimovich from comment #1) > > > > > #define MMIX_

[Bug fortran/84487] [8/9 Regression] Large rodate section increase in 465.tonto with r254427

2018-05-23 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=84487 Wilco changed: What|Removed |Added CC||wilco at gcc dot gnu.org --- Comment #7 from

[Bug libstdc++/86168] New: [9 regression] libstdc++ tests fail with error: call of overloaded 'random_device()' is ambiguous

2018-06-15 Thread wilco at gcc dot gnu.org
NCONFIRMED Severity: normal Priority: P3 Component: libstdc++ Assignee: unassigned at gcc dot gnu.org Reporter: wilco at gcc dot gnu.org Target Milestone: --- New failures: FAIL: 25_algorithms/make_heap/complexity.cc (test for excess errors) FAIL: 25_

[Bug tree-optimization/64946] [AArch64] gcc.target/aarch64/vect-abs-compile.c - "abs" vectorization fails for char/short types

2018-06-18 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=64946 --- Comment #26 from Wilco --- Author: wilco Date: Mon Jun 18 11:10:51 2018 New Revision: 261698 URL: https://gcc.gnu.org/viewcvs?rev=261698&root=gcc&view=rev Log: [testsuite] Remove xfail from vect-abs-compile.c Since PR64946 has been fixed, w

[Bug tree-optimization/64946] [AArch64] gcc.target/aarch64/vect-abs-compile.c - "abs" vectorization fails for char/short types

2018-06-18 Thread wilco at gcc dot gnu.org
||wilco at gcc dot gnu.org Resolution|--- |FIXED --- Comment #27 from Wilco --- Fixed and test updated.

[Bug tree-optimization/86076] [7/8 Regression] ICE: verify_gimple failed (error: location references block not in block tree)

2018-06-18 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=86076 --- Comment #7 from Wilco --- Author: wilco Date: Mon Jun 18 12:17:10 2018 New Revision: 261699 URL: https://gcc.gnu.org/viewcvs?rev=261699&root=gcc&view=rev Log: [testsuite] Add target pthread to pr86076.c Add missing target pthread to ensure

[Bug libstdc++/86333] New: [9 regression] libstdc++ basic_string fails to link

2018-06-27 Thread wilco at gcc dot gnu.org
: libstdc++ Assignee: unassigned at gcc dot gnu.org Reporter: wilco at gcc dot gnu.org Target Milestone: --- Recently these tests started to fail on arm-none-linux-gnueabihf: FAIL: 21_strings/basic_string/modifiers/64422.cc (test for excess errors) FAIL: 21_strings

[Bug libstdc++/86335] New: [9 regression] libstdc++ memory_resource gives error on 32-bit targets

2018-06-27 Thread wilco at gcc dot gnu.org
Priority: P3 Component: libstdc++ Assignee: unassigned at gcc dot gnu.org Reporter: wilco at gcc dot gnu.org Target Milestone: --- The following warning happens when you use memory_resource: ...libstdc++-v3/include/experimental/memory_resource:329: warning

[Bug middle-end/86336] New: [9 regression] ICE in omp-low.c:7879

2018-06-27 Thread wilco at gcc dot gnu.org
Assignee: unassigned at gcc dot gnu.org Reporter: wilco at gcc dot gnu.org Target Milestone: --- Recently omp-low.c started to show an ICE: UNRESOLVED: libgomp.oacc-c++/non-scalar-data.C -DACC_DEVICE_TYPE_host=1 -DACC_MEM_SHARED=1 -O2 compilation failed to produce executable

[Bug fortran/86337] New: [9 regression] gfortran segmentation fault in resolve.c:3119

2018-06-27 Thread wilco at gcc dot gnu.org
Priority: P3 Component: fortran Assignee: unassigned at gcc dot gnu.org Reporter: wilco at gcc dot gnu.org Target Milestone: --- Recently the gfortran.dg/typebound_proc_35.f90 -Os test fails as follows: f951: internal compiler error: Segmentation fault 0xc58e8f

[Bug fortran/86337] [9 regression] gfortran segmentation fault in resolve.c:3119

2018-06-28 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=86337 Wilco changed: What|Removed |Added Keywords||ice-on-valid-code Status|UNCONFIRMED

[Bug libstdc++/86333] [9 regression] libstdc++ basic_string fails to link

2018-06-28 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=86333 --- Comment #3 from Wilco --- (In reply to Jonathan Wakely from comment #2) > These are the same failures as clyon reported for arm > https://gcc.gnu.org/ml/libstdc++/2018-06/msg00130.html > > So I'm going to assume this report is based on an ol

[Bug target/85711] [8 regression] ICE in aarch64_classify_address, at config/aarch64/aarch64.c:5678

2019-07-15 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=85711 --- Comment #5 from Wilco --- Author: wilco Date: Mon Jul 15 11:00:48 2019 New Revision: 273491 URL: https://gcc.gnu.org/viewcvs?rev=273491&root=gcc&view=rev Log: Backport fix for PR85711 Backport from mainline 2019-01-23 Bin C

[Bug target/85711] ICE in aarch64_classify_address, at config/aarch64/aarch64.c:5678

2019-07-15 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=85711 Wilco changed: What|Removed |Added Status|NEW |RESOLVED Resolution|---

[Bug bootstrap/91176] New: [10 regression] AArch64 bootstrap fails since r273479

2019-07-15 Thread wilco at gcc dot gnu.org
Component: bootstrap Assignee: unassigned at gcc dot gnu.org Reporter: wilco at gcc dot gnu.org Target Milestone: --- Since r273479 bootstrap fails with this error on AArch64: Comparing stages 2 and 3 Bootstrap comparison failure! gcc/aarch64.o differs gcc/dwarf2out.o differs gcc

[Bug bootstrap/91176] [10 regression] AArch64 bootstrap fails since r273479

2019-07-15 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=91176 --- Comment #2 from Wilco --- (In reply to rsand...@gcc.gnu.org from comment #1) > Dramantic differences in file size are expected, since stage2 is built with > -gtoggle (to suppress debug info) whereas stage 3 is built normally. One of > the th

[Bug tree-optimization/83518] [8/9 Regression] Missing optimization: useless instructions should be dropped

2019-07-16 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=83518 Wilco changed: What|Removed |Added Status|RESOLVED|REOPENED Resolution|FIXED

[Bug target/89190] [8 regression][ARM] armv8-m.base invalid ldm ICE

2019-07-16 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=89190 --- Comment #4 from Wilco --- Author: wilco Date: Tue Jul 16 12:00:42 2019 New Revision: 273523 URL: https://gcc.gnu.org/viewcvs?rev=273523&root=gcc&view=rev Log: [ARM] Fix Thumb-1 ldm (PR89190) This patch fixes an ICE in the Thumb-1 LDM peepho

[Bug target/89190] [8 regression][ARM] armv8-m.base invalid ldm ICE

2019-07-16 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=89190 --- Comment #5 from Wilco --- Author: wilco Date: Tue Jul 16 12:06:55 2019 New Revision: 273524 URL: https://gcc.gnu.org/viewcvs?rev=273524&root=gcc&view=rev Log: Add missing testcase for PR89190 testsuite/ PR target/89190 *

[Bug target/89190] [8 regression][ARM] armv8-m.base invalid ldm ICE

2019-07-16 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=89190 --- Comment #6 from Wilco --- Author: wilco Date: Tue Jul 16 12:12:01 2019 New Revision: 273525 URL: https://gcc.gnu.org/viewcvs?rev=273525&root=gcc&view=rev Log: Add missing testcase for PR89190 testsuite/ PR target/89190 *

[Bug target/89190] [8 regression][ARM] armv8-m.base invalid ldm ICE

2019-07-16 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=89190 --- Comment #7 from Wilco --- Backported to GCC8 branch, and added missing testcase to trunk and GCC9. All fixed.

[Bug target/89190] [ARM] armv8-m.base invalid ldm ICE

2019-07-16 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=89190 Wilco changed: What|Removed |Added Status|ASSIGNED|RESOLVED Resolution|---

[Bug target/89222] [7/8 regression] ARM thumb-2 misoptimisation of func ptr call with -O2 or -Os

2019-07-16 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=89222 --- Comment #10 from Wilco --- Author: wilco Date: Tue Jul 16 12:37:14 2019 New Revision: 273526 URL: https://gcc.gnu.org/viewcvs?rev=273526&root=gcc&view=rev Log: [ARM] Fix PR89222 The GCC optimizer can generate symbols with non-zero offset fr

[Bug target/89222] [7 regression] ARM thumb-2 misoptimisation of func ptr call with -O2 or -Os

2019-07-16 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=89222 --- Comment #11 from Wilco --- Author: wilco Date: Tue Jul 16 13:13:26 2019 New Revision: 273527 URL: https://gcc.gnu.org/viewcvs?rev=273527&root=gcc&view=rev Log: [ARM] Fix PR89222 The GCC optimizer can generate symbols with non-zero offset fr

[Bug target/89222] [7 regression] ARM thumb-2 misoptimisation of func ptr call with -O2 or -Os

2019-07-16 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=89222 Wilco changed: What|Removed |Added Status|ASSIGNED|RESOLVED Resolution|---

[Bug target/89222] [7 regression] ARM thumb-2 misoptimisation of func ptr call with -O2 or -Os

2019-07-16 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=89222 Wilco changed: What|Removed |Added Target Milestone|8.5 |7.5

[Bug driver/90684] New alignment options incorrectly report error

2019-07-16 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=90684 --- Comment #4 from Wilco --- Author: wilco Date: Tue Jul 16 16:17:39 2019 New Revision: 273533 URL: https://gcc.gnu.org/viewcvs?rev=273533&root=gcc&view=rev Log: Fix alignment option parser (PR90684) Fix the alignment option parser to always a

[Bug driver/90684] New alignment options incorrectly report error

2019-07-16 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=90684 Wilco changed: What|Removed |Added Status|ASSIGNED|RESOLVED Resolution|---

[Bug target/90317] [7/8/9] ICE for arm sha1h and wrong optimisations on sha1h/c/m/p

2019-07-18 Thread wilco at gcc dot gnu.org
dot gnu.org |wilco at gcc dot gnu.org Summary|[7/8/9/10] ICE for arm |[7/8/9] ICE for arm sha1h |sha1h and wrong |and wrong optimisations on |optimisations on|sha1h/c/m/p |sha1h/c/m/p

[Bug tree-optimization/90883] Generated code is worse if returned struct is unnamed

2019-07-23 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=90883 --- Comment #17 from Wilco --- (In reply to Jeffrey A. Law from comment #16) > The issue here (of course) is that aarch64 has a different set of defaults > for when to open-code vs loop vs function call. My attempts to pick a > better size for

[Bug tree-optimization/91144] [10 regression] 176.gcc miscompare after r273294

2019-08-01 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=91144 Wilco changed: What|Removed |Added Status|UNCONFIRMED |NEW Last reconfirmed|

[Bug tree-optimization/91322] New: [10 regression] alias-4 test failure

2019-08-01 Thread wilco at gcc dot gnu.org
-optimization Assignee: unassigned at gcc dot gnu.org Reporter: wilco at gcc dot gnu.org Target Milestone: --- The new alias-4_0.C fails on Arm. Without LTO it also fails with -O2 on AArch64, but it fails with -O2 and -O3 on Arm, so something must be different. Unfortunately even

[Bug target/91386] open-iscsi iscsiadm miscompiled by LTO on aarch64

2019-08-07 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=91386 Wilco changed: What|Removed |Added CC||wilco at gcc dot gnu.org --- Comment #12 from

[Bug target/91386] open-iscsi iscsiadm miscompiled by LTO on aarch64

2019-08-07 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=91386 --- Comment #14 from Wilco --- (In reply to Martin Liška from comment #13) > > > > The key question is how does one dump rtl with -flto? It doesn't work at > > all, making debugging this difficult... > > It does, look: > > marxin@marxinbox:/tm

[Bug tree-optimization/90838] Detect table-based ctz implementation

2019-08-09 Thread wilco at gcc dot gnu.org
||2019-08-09 Assignee|unassigned at gcc dot gnu.org |wilco at gcc dot gnu.org Ever confirmed|0 |1 --- Comment #7 from Wilco --- I'll have a look at this, I think it could easily be done in match.pd if we add support for matching

[Bug middle-end/90693] Missing popcount simplifications

2019-08-09 Thread wilco at gcc dot gnu.org
||2019-08-09 Assignee|unassigned at gcc dot gnu.org |wilco at gcc dot gnu.org Ever confirmed|0 |1

[Bug target/91386] [9 regression] open-iscsi iscsiadm miscompiled by LTO on aarch64

2019-08-12 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=91386 --- Comment #22 from Wilco --- (In reply to Richard Earnshaw from comment #21) > Fixed on trunk. I ran an AArch64 bootstrap on GCC9 branch and that is fine.

[Bug target/81800] [8/9/10 regression] on aarch64 ilp32 lrint should not be inlined as two instructions

2019-08-13 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=81800 --- Comment #17 from Wilco --- Author: wilco Date: Tue Aug 13 10:46:44 2019 New Revision: 274376 URL: https://gcc.gnu.org/viewcvs?rev=274376&root=gcc&view=rev Log: [AArch64] Fix PR81800 PR81800 is about the lrint inline giving spurious FE_INEXA

[Bug target/81800] [8/9 regression] on aarch64 ilp32 lrint should not be inlined as two instructions

2019-08-13 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=81800 Wilco changed: What|Removed |Added Known to work||10.0 Summary|[8/9/10 regression] on

[Bug target/77308] surprisingly large stack usage for sha512 on arm

2019-08-23 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=77308 Wilco changed: What|Removed |Added Status|NEW |RESOLVED Resolution|---

[Bug target/91598] [8/9/10 regression] 60% speed drop on neon intrinsic loop

2019-08-30 Thread wilco at gcc dot gnu.org
|NEW Last reconfirmed||2019-08-30 CC||wilco at gcc dot gnu.org Ever confirmed|0 |1 --- Comment #3 from Wilco --- (In reply to Maxim Kuvyrkov from comment #2) > Created attachm

[Bug tree-optimization/83661] sincos does not handle sin(2x)

2019-09-02 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=83661 Wilco changed: What|Removed |Added CC||wilco at gcc dot gnu.org --- Comment #8 from

[Bug tree-optimization/83661] sincos does not handle sin(2x)

2019-09-02 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=83661 --- Comment #9 from Wilco --- (In reply to Christophe Monat from comment #4) > Hi Pratamesh, > > You're absolutely right - maybe it's more efficient when there is some > hardware sincos available (Intel FSINCOS ?) but I would check also carefull

[Bug target/81800] [8/9 regression] on aarch64 ilp32 lrint should not be inlined as two instructions

2019-09-04 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=81800 --- Comment #19 from Wilco --- Author: wilco Date: Wed Sep 4 12:42:22 2019 New Revision: 275373 URL: https://gcc.gnu.org/viewcvs?rev=275373&root=gcc&view=rev Log: [AArch64] Fix PR81800 PR81800 is about the lrint inline giving spurious FE_INEXA

[Bug target/81800] [8/9 regression] on aarch64 ilp32 lrint should not be inlined as two instructions

2019-09-04 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=81800 --- Comment #20 from Wilco --- Author: wilco Date: Wed Sep 4 13:06:55 2019 New Revision: 275374 URL: https://gcc.gnu.org/viewcvs?rev=275374&root=gcc&view=rev Log: [AArch64] Fix PR81800 PR81800 is about the lrint inline giving spurious FE_INEXA

[Bug target/81800] On aarch64 ilp32 lrint should not be inlined as two instructions

2019-09-04 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=81800 --- Comment #21 from Wilco --- Backported to GCC8 and GCC9 too.

[Bug target/81800] On aarch64 ilp32 lrint should not be inlined as two instructions

2019-09-04 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=81800 Wilco changed: What|Removed |Added Status|REOPENED|RESOLVED Resolution|---

[Bug target/91684] New: [10 regression][ARM] ICE in gen_movdi, at config/arm/arm.md:5079

2019-09-06 Thread wilco at gcc dot gnu.org
Priority: P3 Component: target Assignee: unassigned at gcc dot gnu.org Reporter: wilco at gcc dot gnu.org Target Milestone: --- The following small example fails with an ICE with -mcpu=cortex-a57: typedef struct { int a, b, c; } S; void g (S *s); void bug1 (void

[Bug target/91684] [10 regression][ARM] ICE in gen_movdi, at config/arm/arm.md:5079

2019-09-06 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=91684 Wilco changed: What|Removed |Added Status|UNCONFIRMED |NEW Last reconfirmed|

[Bug target/91708] New: [10 regression][ARM] Bootstrap fails in gen_movsi, at config/arm/arm.md:5258

2019-09-09 Thread wilco at gcc dot gnu.org
: normal Priority: P3 Component: target Assignee: unassigned at gcc dot gnu.org Reporter: wilco at gcc dot gnu.org Target Milestone: --- Arm bootstrap still fails with latest trunk when configured with --with-cpu=cortex-a57: gcc/gcc/real.c:5240:1: internal

[Bug target/91708] [10 regression][ARM] Bootstrap fails in gen_movsi, at config/arm/arm.md:5258

2019-09-09 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=91708 --- Comment #2 from Wilco --- (In reply to Bernd Edlinger from comment #1) > Oh, nice, could you say what config options you use? The full set I used: ${src}/configure --disable-libsanitizer --enable-languages=c,c++ --prefix=${install} --targe

[Bug target/91708] [10 regression][ARM] Bootstrap fails in gen_movsi, at config/arm/arm.md:5258

2019-09-09 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=91708 --- Comment #5 from Wilco --- (In reply to Christophe Lyon from comment #4) > (In reply to Bernd Edlinger from comment #3) > > I will try to reproduce with building of a cross for this target. > > FWIW, cross-build looked fine for me at r275501:

[Bug rtl-optimization/42575] arm-eabi-gcc 64-bit multiply weirdness

2019-09-09 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=42575 --- Comment #20 from Wilco --- (In reply to Wilco from comment #19) > (In reply to Christophe Lyon from comment #18) > > This is still wrong with current trunk. > > I don't see it happening since expansion of DImode instructions improved. > The

[Bug fortran/91690] Slow IEEE intrinsics

2019-09-09 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=91690 Wilco changed: What|Removed |Added CC||wilco at gcc dot gnu.org --- Comment #4 from

[Bug target/91708] [10 regression][ARM] Bootstrap fails in gen_movsi, at config/arm/arm.md:5258

2019-09-09 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=91708 --- Comment #8 from Wilco --- (In reply to Bernd Edlinger from comment #7) > without looking in detail, this could > be another middle-end error or the back-end > generating an invalid instruction where no assertions > are, then lra can rewrite t

[Bug target/91708] [10 regression][ARM] Bootstrap fails in gen_movsi, at config/arm/arm.md:5258

2019-09-10 Thread wilco at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=91708 --- Comment #11 from Wilco --- Created attachment 46862 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=46862&action=edit Reproducer g++ -mcpu=cortex-a57 -mfpu=fp-armv8 out.c -O2 -c reproduces the issue outside of a bootstrap

<    1   2   3   4   5   6   7   8   >