[Bug target/108922] fmod() 13x slowdown in gcc4.9 dropping "fprem" and calling fmod()

2023-02-27 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=108922 --- Comment #17 from Uroš Bizjak --- (In reply to Jakub Jelinek from comment #16) > Doesn't the SDM guarantee the right behavior though? Indeed, this is what is missing from Table 3-31. > It is true that the FPREM results table says * and **

[Bug target/108922] fmod() 13x slowdown in gcc4.9 dropping "fprem" and calling fmod()

2023-02-26 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=108922 --- Comment #14 from Uroš Bizjak --- (In reply to Jan Kratochvil from comment #13) > The question is whether gcc can rely on the undocumented Intel behavior as > described in Comment 7. glibc already relies on it anyway. I don't think this is

[Bug target/108922] fmod() 13x slowdown in gcc4.9 dropping "fprem" and calling fmod()

2023-02-26 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=108922 --- Comment #12 from Uroš Bizjak --- (In reply to Jan Kratochvil from comment #8) > The revert makes it 13x faster. But the produced code still falls back to > calling glibc fmod() as shown in the disassembly in Comment 0. > If I use the

[Bug target/108922] fmod() 13x slowdown in gcc4.9 dropping "fprem" and calling fmod()

2023-02-26 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=108922 --- Comment #11 from Uroš Bizjak --- (In reply to Jan Kratochvil from comment #8) > It is true replacing fmod() with fmodl() makes it 5x faster (but only 5x). > There is still some infinity check and I haven't found any real > justification in

[Bug target/108922] fmod() 13x slowdown in gcc4.9 dropping "fprem" and calling fmod()

2023-02-25 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=108922 --- Comment #6 from Uroš Bizjak --- (In reply to Uroš Bizjak from comment #5) > (In reply to Alexander Monakov from comment #3) > > I guess Uros' claim was based on what Intel and AMD manuals specify rather > > than observed behavior of CPUs. >

[Bug target/108922] fmod() 13x slowdown in gcc4.9 dropping "fprem" and calling fmod()

2023-02-25 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=108922 --- Comment #5 from Uroš Bizjak --- (In reply to Alexander Monakov from comment #3) > I guess Uros' claim was based on what Intel and AMD manuals specify rather > than observed behavior of CPUs. As a "committer", I really don't remember the

[Bug target/104375] [x86] Failure to recognize bzhi pattern when shr is present

2023-02-18 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=104375 Uroš Bizjak changed: What|Removed |Added Status|UNCONFIRMED |RESOLVED Resolution|---

[Bug target/94908] Failure to optimally optimize certain shuffle patterns

2023-02-18 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=94908 Uroš Bizjak changed: What|Removed |Added CC||crazylht at gmail dot com --- Comment #5

[Bug target/108831] QImode binary ops with one zero-extracted argument can be optimized

2023-02-17 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=108831 Uroš Bizjak changed: What|Removed |Added Resolution|--- |FIXED Target Milestone|---

[Bug rtl-optimization/108805] [13 Regression] ICE: in simplify_subreg, at simplify-rtx.cc:7400 at -O and above

2023-02-17 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=108805 Uroš Bizjak changed: What|Removed |Added Status|ASSIGNED|RESOLVED Resolution|---

[Bug rtl-optimization/108805] [13 Regression] ICE: in simplify_subreg, at simplify-rtx.cc:7400 at -O and above

2023-02-17 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=108805 Uroš Bizjak changed: What|Removed |Added CC||avieira at gcc dot gnu.org --- Comment

[Bug target/108832] [13 Regression] ICE in replace_rtx, at rtlanal.cc:3358

2023-02-17 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=108832 --- Comment #4 from Uroš Bizjak --- (In reply to Jakub Jelinek from comment #1) > and so ICEs if we see the same REGNO as from in a different mode. > I think we actually don't need most of what replace_rtx is doing, we don't > need to simplify

[Bug target/108831] QImode binary ops with one zero-extracted argument can be optimized

2023-02-17 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=108831 --- Comment #2 from Uroš Bizjak --- (In reply to Uroš Bizjak from comment #1) > The patch also handles constant memory operands on x86_64. --cut here-- struct S { unsigned char pad1; unsigned char val; unsigned short pad2; }; unsigned

[Bug target/108831] QImode binary ops with one zero-extracted argument can be optimized

2023-02-17 Thread ubizjak at gmail dot com via Gcc-bugs
dot gnu.org |ubizjak at gmail dot com Status|UNCONFIRMED |ASSIGNED Last reconfirmed||2023-02-17 --- Comment #1 from Uroš Bizjak --- Created attachment 54479 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=54479=edit Propo

[Bug target/108831] New: QImode binary ops with one zero-extracted argument can be optimized

2023-02-17 Thread ubizjak at gmail dot com via Gcc-bugs
Priority: P3 Component: target Assignee: unassigned at gcc dot gnu.org Reporter: ubizjak at gmail dot com Target Milestone: --- Following testcase: --cut here-- struct S { unsigned char pad1; unsigned char val; unsigned short pad2; }; unsigned char

[Bug rtl-optimization/108805] [13 Regression] ICE: in simplify_subreg, at simplify-rtx.cc:7400 at -O and above

2023-02-15 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=108805 Uroš Bizjak changed: What|Removed |Added Assignee|unassigned at gcc dot gnu.org |ubizjak at gmail dot com

[Bug rtl-optimization/108805] [13 Regression] ICE: in simplify_subreg, at simplify-rtx.cc:7400 at -O and above

2023-02-15 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=108805 Uroš Bizjak changed: What|Removed |Added Target Milestone|--- |13.0 Status|UNCONFIRMED

[Bug target/108516] [11/12 Regression] Useless movzx instruction emitted when loading 8 bits from 24 bit struct

2023-02-13 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=108516 Uroš Bizjak changed: What|Removed |Added Resolution|--- |FIXED Summary|Useless movzx

[Bug rtl-optimization/108516] Useless movzx instruction emitted when loading 8 bits from 24 bit struct

2023-02-13 Thread ubizjak at gmail dot com via Gcc-bugs
at gcc dot gnu.org |ubizjak at gmail dot com --- Comment #6 from Uroš Bizjak --- Created attachment 54454 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=54454=edit patch to relax extract modes This patch relaxes extract and insert operand modes to no longer match op mode.

[Bug rtl-optimization/104054] [10/11/12/13 Regression] '-fcompare-debug' failure (length) w/ -Os -funroll-loops since r8-3946-gfe94440235cfaa57

2023-02-03 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=104054 --- Comment #11 from Uroš Bizjak --- The testcase now PASSes compare-debug with: gcc version 13.0.1 20230203 (experimental) [master r13-5678-g167b04b9b8a] (GCC) ... but passes due to different register allocation, where regrename is not

[Bug tree-optimization/108552] Linux i386 kernel 5.14 memory corruption for pre_compound_page() when gcov is enabled

2023-01-27 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=108552 --- Comment #22 from Uroš Bizjak --- BTW: It is the reload pass that duplicates read from __gcov0.prep_compound_page[7].

[Bug tree-optimization/108552] Linux i386 kernel 5.14 memory corruption for pre_compound_page() when gcov is enabled

2023-01-27 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=108552 --- Comment #20 from Uroš Bizjak --- (In reply to Uroš Bizjak from comment #19) > __gcov0.prep_compound_page. But as shown in Comment #15, we have two Comment #16, actually.

[Bug tree-optimization/108552] Linux i386 kernel 5.14 memory corruption for pre_compound_page() when gcov is enabled

2023-01-27 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=108552 --- Comment #19 from Uroš Bizjak --- Some further analysis: pretmp_94 = __gcov0.prep_compound_page[7]; <-- _179 = pretmp_94 + 1; <-- ivtmp.1725_211 = (unsigned long long) _179; _135 = (unsigned int) nr_pages_11;

[Bug tree-optimization/108552] Linux i386 kernel 5.14 memory corruption for pre_compound_page() when gcov is enabled

2023-01-27 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=108552 Uroš Bizjak changed: What|Removed |Added Component|target |tree-optimization

[Bug target/108552] Linux i386 kernel 5.14 memory corruption for pre_compound_page() when gcov is enabled

2023-01-27 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=108552 --- Comment #17 from Uroš Bizjak --- The assembly is just mirroring what tree optimizers prepare: pretmp_94 = __gcov0.prep_compound_page[7]; _179 = pretmp_94 + 1; ivtmp.1725_211 = (unsigned long long) _179; ... [local count:

[Bug target/108552] Linux i386 kernel 5.14 memory corruption for pre_compound_page() when gcov is enabled

2023-01-27 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=108552 --- Comment #16 from Uroš Bizjak --- addl$1, __gcov0.prep_compound_page+48 adcl$0, __gcov0.prep_compound_page+52 cmpl$1, %ebx jle .L1470 leal1(%edi), %eax movl

[Bug target/108552] Linux i386 kernel 5.14 memory corruption for pre_compound_page() when gcov is enabled

2023-01-27 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=108552 --- Comment #15 from Uroš Bizjak --- Sorry, %esi/%edi is the correct order. -24(%ebp): some value previously saved to stack frame %ecx: address to write to %eax/%edx: loop iterator %esi/%edi: termination value .L1469: movl%eax,

[Bug target/108552] Linux i386 kernel 5.14 memory corruption for pre_compound_page() when gcov is enabled

2023-01-27 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=108552 --- Comment #14 from Uroš Bizjak --- The loop is actually pretty simple, please see the interpretation below -24(%ebp): some value previously saved to stack frame %ecx: address to write to %eax/%edx: loop iterator %edi/%esi: termination value

[Bug target/108552] Linux i386 kernel 5.14 memory corruption for pre_compound_page() when gcov is enabled

2023-01-27 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=108552 --- Comment #13 from Uroš Bizjak --- -fverbose-asm annotated assembly: prep_compound_page: pushl %ebp# movl%esp, %ebp #, pushl %edi# movl%eax, %edi # tmp356, page pushl

[Bug tree-optimization/108477] New: fwprop over-optimizes conversion from + to |

2023-01-20 Thread ubizjak at gmail dot com via Gcc-bugs
: tree-optimization Assignee: unassigned at gcc dot gnu.org Reporter: ubizjak at gmail dot com Target Milestone: --- In the unsigned int case (baz) fwprop over-optimizes the addition to a logical or: --cut here-- int lock; int bar (int old) { int val = (old >> 1) &

[Bug rtl-optimization/108292] [13 Regression] wrong code with vector compare & mask at -O1 and above

2023-01-05 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=108292 --- Comment #11 from Uroš Bizjak --- (In reply to Jakub Jelinek from comment #10) > (In reply to Roger Sayle from comment #8) > > Here's my proposed patch (or something close to it, it's still bootstrapping > > and regression testing). The

[Bug target/107934] ICE: SIGSEGV in immediate_operand (recog.cc:1618) with -O2 -mtune=knl -ffinite-math-only and __bf16 since r13-4314-ga1ecc5600464f6a6

2022-11-30 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=107934 --- Comment #2 from Uroš Bizjak --- The type of extendbfsf2_1 insn should be sseishft1.

[Bug target/107671] i386: Missed optimization: use of bt in bit test pattern (using -O2 -mtune=core2)

2022-11-15 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=107671 --- Comment #4 from Uroš Bizjak --- Created attachment 53901 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=53901=edit Patch that adds relevant zero_extract patterns This patch adds relevant zero_extract patterns that optimize:

[Bug target/107404] [12/13 Regression] Wrong code with -O3 since r12-6416-g037cc0b4a6646cc8

2022-11-03 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=107404 Uroš Bizjak changed: What|Removed |Added Status|ASSIGNED|RESOLVED Resolution|---

[Bug tree-optimization/103035] [meta-bug] YARPGen bugs

2022-11-03 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=103035 Bug 103035 depends on bug 107404, which changed state. Bug 107404 Summary: [12/13 Regression] Wrong code with -O3 since r12-6416-g037cc0b4a6646cc8 https://gcc.gnu.org/bugzilla/show_bug.cgi?id=107404 What|Removed

[Bug target/107404] [12/13 Regression] Wrong code with -O3 since r12-6416-g037cc0b4a6646cc8

2022-11-03 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=107404 Uroš Bizjak changed: What|Removed |Added Assignee|unassigned at gcc dot gnu.org |ubizjak at gmail dot com

[Bug target/107404] [12/13 Regression] Wrong code with -O3 since r12-6416-g037cc0b4a6646cc8

2022-11-03 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=107404 --- Comment #4 from Uroš Bizjak --- (In reply to rsand...@gcc.gnu.org from comment #3) > This is due to the peephole2 added in r12-2640-gf7bf03cf69cc: > > ;; Eliminate a reg-reg mov by inverting the condition of a cmov (#2). > ;; mov r2,r3;

[Bug rtl-optimization/107057] [10/11/12/13 Regression] ICE in extract_constrain_insn, at recog.cc:2692

2022-10-28 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=107057 --- Comment #9 from Uroš Bizjak --- (In reply to Hongtao.liu from comment #7) > And it looks like the pattern is wrongly defined since from [1]. > > --cut begin > Matching constraints are used in these circumstances. More

[Bug target/107281] comparisations with u/int64_t constants not generate vector-result

2022-10-16 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=107281 --- Comment #2 from Uroš Bizjak --- Try to compile the testcase with -msse4.2.

[Bug target/107259] New: Vectorize VxHF _Float16 modes using partial vectors without -mavx512vl

2022-10-14 Thread ubizjak at gmail dot com via Gcc-bugs
Priority: P3 Component: target Assignee: unassigned at gcc dot gnu.org Reporter: ubizjak at gmail dot com Target Milestone: --- The following testcase: --cut here-- #define N 16 _Float16 r[N], a[N], b[N]; void foo (void) { for (int i = 0; i < N

[Bug target/107172] [13 Regression] wrong code with "-O1 -ftree-vrp" on x86_64-linux-gnu since r13-1268-g8c99e307b20c502e

2022-10-13 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=107172 --- Comment #24 from Uroš Bizjak --- (In reply to Hongtao.liu from comment #23) > looking at i386.c put_condition_code used by *setcc_qi, it looks like (EQ > (reg:CCCmode FLAG_REG) (const_int 0)) means get carry flag. > Not (LTU: (REG:CCCmode

[Bug rtl-optimization/107057] [10/11/12/13 Regression] ICE in extract_constrain_insn, at recog.cc:2692

2022-09-29 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=107057 Uroš Bizjak changed: What|Removed |Added Component|target |rtl-optimization Keywords|

[Bug target/107057] [10/11/12/13 Regression] ICE in extract_constrain_insn, at recog.cc:2692

2022-09-29 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=107057 --- Comment #2 from Uroš Bizjak --- Reload starts with: (insn 76 67 101 5 (set (reg/v:V2DF 108 [ x ]) (vec_concat:V2DF (reg:DF 182) (reg:DF 182))) "pr107057.c":7:10 5952 {vec_concatv2df} (expr_list:REG_EQUAL

[Bug target/106966] alpha cross build crashes gcc-12 "internal compiler error: in emit_move_insn"

2022-09-23 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=106966 Uroš Bizjak changed: What|Removed |Added Ever confirmed|0 |1 Status|UNCONFIRMED

[Bug target/106966] alpha cross build crashes gcc-12 "internal compiler error: in emit_move_insn"

2022-09-20 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=106966 --- Comment #4 from Uroš Bizjak --- (In reply to Christian Ehrhardt from comment #3) > > Just drop -mbuild-constants. > > Thanks for the hint Uroš, but I'm not sure if one can do that, this option > is from [1]. I do not have the background on

[Bug target/106966] alpha cross build crashes gcc-12 "internal compiler error: in emit_move_insn"

2022-09-19 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=106966 --- Comment #2 from Uroš Bizjak --- (In reply to Christian Ehrhardt from comment #0) > alpha-linux-gnu-gcc -O2 -g1 -Wall -fvisibility=hidden -fno-strict-aliasing > -msmall-text -msmall-data -mno-fp-regs -mbuild-constants -mcpu=ev67 Just drop

[Bug target/106707] [13 Regression] ICE: in cselib_record_set, at cselib.cc:2687 with -Oz -g -fno-cprop-registers -fno-dce since r13-1945-gfc6ef90173478521

2022-09-01 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=106707 Uroš Bizjak changed: What|Removed |Added Resolution|--- |FIXED Status|NEW

[Bug target/106707] [13 Regression] ICE: in cselib_record_set, at cselib.cc:2687 with -Oz -g -fno-cprop-registers -fno-dce since r13-1945-gfc6ef90173478521

2022-08-30 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=106707 --- Comment #5 from Uroš Bizjak --- The following pattern is matched by a peephole2 pattern: (insn 164 161 165 5 (set (reg:DI 0 ax [orig:91 _10 ] [91]) (reg:DI 0 ax)) "pr106707.c":13:12 82 {*movdi_internal} (expr_list:REG_UNUSED

[Bug go/85037] SIGSEGV in gotools testsuite affects several tests

2022-08-18 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=85037 Uroš Bizjak changed: What|Removed |Added Resolution|--- |WONTFIX Status|UNCONFIRMED

[Bug d/90261] FAIL: libphobos.phobos/std/file.d on CentOS 5.11, Linux 2.6.18

2022-08-18 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=90261 Uroš Bizjak changed: What|Removed |Added Status|UNCONFIRMED |RESOLVED Resolution|---

[Bug target/101346] ICE: maximum number of generated reload insns per insn achieved (90)

2022-08-18 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=101346 Uroš Bizjak changed: What|Removed |Added Target Milestone|--- |12.2 Resolution|---

[Bug target/103861] [i386] vectorize v2qi vectors

2022-08-18 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=103861 Uroš Bizjak changed: What|Removed |Added Resolution|--- |FIXED Status|NEW

[Bug tree-optimization/53947] [meta-bug] vectorizer missed-optimizations

2022-08-18 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=53947 Bug 53947 depends on bug 95201, which changed state. Bug 95201 Summary: Some x86 vector-extend patterns are not exercised. https://gcc.gnu.org/bugzilla/show_bug.cgi?id=95201 What|Removed |Added

[Bug tree-optimization/95201] Some x86 vector-extend patterns are not exercised.

2022-08-18 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=95201 Uroš Bizjak changed: What|Removed |Added Resolution|--- |FIXED Status|UNCONFIRMED

[Bug target/92658] x86 lacks vector extend / truncate

2022-08-18 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92658 Bug 92658 depends on bug 95201, which changed state. Bug 95201 Summary: Some x86 vector-extend patterns are not exercised. https://gcc.gnu.org/bugzilla/show_bug.cgi?id=95201 What|Removed |Added

[Bug tree-optimization/106322] tree-vectorize: Wrong code at O2 level (-fno-tree-vectorize is working)

2022-08-09 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=106322 --- Comment #22 from Uroš Bizjak --- (In reply to Martin Liška from comment #20) > Hmm, can't reproduce with x86_64 compiler with -m32: > > $ g++ --version > g++ (SUSE Linux) 12.1.1 20220721 [revision > 4f15d2234608e82159d030dadb17af678cfad626

[Bug target/81708] The x86 stack canary location should be customizable

2022-07-19 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=81708 --- Comment #16 from Uroš Bizjak --- (In reply to Alexandre Oliva from comment #15) > Uroš, > > stack-prot-sym.c fails on ia32 with PIC/PIE: the address/value of my_guard > is loaded from the GOT, instead of appearing as %gs:my_guard. > >

[Bug target/106322] i386: Wrong code at O2 level (O0 / O1 are working)

2022-07-19 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=106322 --- Comment #10 from Uroš Bizjak --- (In reply to Mathieu Malaterre from comment #9) > Technically I can also execute the `uint16` portion of the unit test and > produce a failure (so this seems to be consistent behavior with signed >

[Bug target/106180] [13 Regression] ICE in extract_insn, at recog.cc:2791 since r13-1418-g73f942c08deef3

2022-07-06 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=106180 --- Comment #6 from Uroš Bizjak --- Comment on attachment 53261 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=53261 This patch aims to handle memory issue when unpacking in cvtps2pd >@@ -9270,7 +9270,15 @@ > (vec_select:V2SF >

[Bug rtl-optimization/104637] [10/11 Regression] ICE: maximum number of LRA assignment passes is achieved (30) with -Og -fno-forward-propagate -mavx since r9-5221-gd8fcab689435a29d

2022-06-29 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=104637 Uroš Bizjak changed: What|Removed |Added Status|NEW |RESOLVED Resolution|---

[Bug target/105209] internal compiler error: in store_data_bypass_p_1

2022-06-20 Thread ubizjak at gmail dot com via Gcc-bugs
at gcc dot gnu.org |ubizjak at gmail dot com Resolution|--- |FIXED Target Milestone|--- |10.4 --- Comment #7 from Uroš Bizjak --- Fixed for gcc-10.4+

[Bug target/105970] ICE in ix86_function_arg, at config/i386/i386.cc:3351

2022-06-20 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=105970 Uroš Bizjak changed: What|Removed |Added Status|NEW |RESOLVED Target Milestone|---

[Bug target/105993] [13 Regression] ICE: RTL check: expected code 'reg', have 'subreg' in rhs_regno, at rtl.h:1932 with -O -mxop

2022-06-17 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=105993 Uroš Bizjak changed: What|Removed |Added Resolution|--- |FIXED Target Milestone|---

[Bug tree-optimization/106012] rsqrtss instruction generated even if -mno-recip specified

2022-06-17 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=106012 Uroš Bizjak changed: What|Removed |Added Ever confirmed|0 |1 Last reconfirmed|

[Bug target/105980] [11/12/13 Regression] ICE in final_scan_insn_1, at final.cc:2811

2022-06-17 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=105980 --- Comment #2 from Uroš Bizjak --- emit_move_insn in this part of ix86_output_mi_thunk: 21464 if (!sibcall_insn_operand (fnaddr, word_mode)) 21465 { 21466 tmp = gen_rtx_REG (word_mode, tmp_regno); 21467

[Bug target/105993] [13 Regression] ICE: RTL check: expected code 'reg', have 'subreg' in rhs_regno, at rtl.h:1932 with -O -mxop

2022-06-16 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=105993 --- Comment #2 from Uroš Bizjak --- Created attachment 53149 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=53149=edit Patch to fix the failure The patch fixes this particular failure by using (match_dup X). In general, rtx_equal_p

[Bug c/105970] ICE in ix86_function_arg, at config/i386/i386.cc:3351

2022-06-14 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=105970 Uroš Bizjak changed: What|Removed |Added Status|UNCONFIRMED |NEW Last reconfirmed|

[Bug target/105953] [12/13 Regression] ICE in extract_insn, at recog.cc:2791

2022-06-14 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=105953 Uroš Bizjak changed: What|Removed |Added Last reconfirmed||2022-06-14

[Bug target/105951] [12/13 Regression] ICE in emit_store_flag, at expmed.cc:6027

2022-06-14 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=105951 --- Comment #2 from Uroš Bizjak --- (In reply to Uroš Bizjak from comment #1) > CC author. g:6362627b27f395b054f359244fcfcb15ac0ac2ab

[Bug target/105951] [12/13 Regression] ICE in emit_store_flag, at expmed.cc:6027

2022-06-14 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=105951 Uroš Bizjak changed: What|Removed |Added Ever confirmed|0 |1 CC|

[Bug target/105927] ICE: RTL check: expected code 'reg', have 'mem' in rhs_regno, at rtl.h:1932 with -mtune=k6-3 -msse

2022-06-13 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=105927 Uroš Bizjak changed: What|Removed |Added Target Milestone|--- |13.0 Resolution|---

[Bug rtl-optimization/104777] [9/10 Regression] gcc crashes while compiling a custom coroutine library sample

2022-06-13 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=104777 Uroš Bizjak changed: What|Removed |Added Resolution|FIXED |--- Status|RESOLVED

[Bug rtl-optimization/104777] [9/10 Regression] gcc crashes while compiling a custom coroutine library sample

2022-06-13 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=104777 Uroš Bizjak changed: What|Removed |Added CC||stsp at users dot sourceforge.net ---

[Bug rtl-optimization/105936] [10 Regression] ICE with inline-asm and TLS on x86_64 and -O2 in move_insn

2022-06-13 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=105936 Uroš Bizjak changed: What|Removed |Added Status|UNCONFIRMED |RESOLVED Resolution|---

[Bug rtl-optimization/105936] [10 Regression] ICE with inline-asm and TLS on x86_64 and -O2 in move_insn

2022-06-13 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=105936 --- Comment #4 from Uroš Bizjak --- Digging a bit further with current gcc-10 branch... Instrumenting a TLS address splitter in i386.md with some creative printfs: (define_split [(match_operand 0 "tls_address_pattern")]

[Bug rtl-optimization/105936] [10 Regression] ICE with inline-asm and TLS on x86_64 and -O2 in move_insn

2022-06-13 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=105936 --- Comment #3 from Uroš Bizjak --- For some reason, split1 pass converts (insn): (insn 54 51 109 9 (parallel [ (asm_operands/v ("btrl %1,%0") ("") 0 [ (mem/c:BLK (plus:DI (plus:DI (unspec:DI [

[Bug target/105927] ICE: RTL check: expected code 'reg', have 'mem' in rhs_regno, at rtl.h:1932 with -mtune=k6-3 -msse

2022-06-12 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=105927 Uroš Bizjak changed: What|Removed |Added Assignee|unassigned at gcc dot gnu.org |ubizjak at gmail dot com

[Bug target/105927] ICE: RTL check: expected code 'reg', have 'mem' in rhs_regno, at rtl.h:1932 with -mtune=k6-3 -msse

2022-06-12 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=105927 Uroš Bizjak changed: What|Removed |Added Ever confirmed|0 |1 Status|UNCONFIRMED

[Bug target/105778] Shift by register --- unnecessary AND instruction

2022-05-31 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=105778 --- Comment #6 from Uroš Bizjak --- (In reply to Jakub Jelinek from comment #4) > It is the same thing done a few lines later in the preexisting code too. > Shall I all of those change to gen_lowpart (QImode, force_reg (GET_MODE >

[Bug target/105778] Shift by register --- unnecessary AND instruction

2022-05-31 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=105778 --- Comment #3 from Uroš Bizjak --- Comment on attachment 53058 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=53058 gcc13-pr105778.patch >+ operands[2] = gen_lowpart (QImode, operands[2]); We have learned that pre-reload splits

[Bug target/105624] [13 Regression] ICE in final_scan_insn_1, at final.cc:2861 (error: could not split insn)

2022-05-30 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=105624 --- Comment #8 from Uroš Bizjak --- > I think it would work to keep the constraints for > const_int_operands that are in a % pair and drop them > elsewhere. (So a partial reapplication, rather than a > full reapplication.) OK, let's throw the

[Bug target/105624] [13 Regression] ICE in final_scan_insn_1, at final.cc:2861 (error: could not split insn)

2022-05-30 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=105624 --- Comment #6 from Uroš Bizjak --- (In reply to rsand...@gcc.gnu.org from comment #5) > FWIW, I think the problem is specific to operands that are > commutative with a non-constant operand. For example, > suppose the pre-RA instruction had a

[Bug target/105624] [13 Regression] ICE in final_scan_insn_1, at final.cc:2861 (error: could not split insn)

2022-05-17 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=105624 Uroš Bizjak changed: What|Removed |Added Resolution|--- |FIXED Status|ASSIGNED

[Bug target/105624] [13 Regression] ICE in final_scan_insn_1, at final.cc:2861 (error: could not split insn)

2022-05-17 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=105624 Uroš Bizjak changed: What|Removed |Added Assignee|unassigned at gcc dot gnu.org |ubizjak at gmail dot com Ever

[Bug target/105624] [13 Regression] ICE in final_scan_insn_1, at final.cc:2861 (error: could not split insn)

2022-05-17 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=105624 --- Comment #1 from Uroš Bizjak --- Ho-hum - this was my patch that removed constraint from const_int predicates. We are talking about: (define_insn_and_split "*anddi_1_btr" [(set (match_operand:DI 0 "nonimmediate_operand" "=rm")

[Bug target/105513] [9/10/11/12/13 Regression] Unnecessary SSE spill since r9-5748-g1d4b4f4979171ef0

2022-05-10 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=105513 --- Comment #3 from Uroš Bizjak --- (In reply to Hongtao.liu from comment #2) > Just note #c4 in pr105504 also solve this issue. > > >Another possible solution is add a little bit dislike for "m" > >alternative(like ?m) to avoid potential

[Bug target/105073] [meta bug]Patch pending for GCC13.

2022-05-03 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=105073 Bug 105073 depends on bug 105079, which changed state. Bug 105079 Summary: _mm_storeu_si16 inefficiently uses pextrw to an integer reg (without SSE4.1) https://gcc.gnu.org/bugzilla/show_bug.cgi?id=105079 What|Removed

[Bug target/105079] _mm_storeu_si16 inefficiently uses pextrw to an integer reg (without SSE4.1)

2022-05-03 Thread ubizjak at gmail dot com via Gcc-bugs
|RESOLVED Target Milestone|--- |13.0 Assignee|unassigned at gcc dot gnu.org |ubizjak at gmail dot com --- Comment #3 from Uroš Bizjak --- Implemented for gcc-13.

[Bug target/105429] Unnecessary moves generated with _mm_crc32_u64

2022-04-29 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=105429 --- Comment #1 from Uroš Bizjak --- The intrinsic is defined as: unsinged __int64 _mm_crc32_u64( unsinged __int64 crc, unsigned __int64 data ) and the unnecessary move is in fact zero-extend: movl%eax, %eax # 16[c=1 l=2]

[Bug target/105073] [meta bug]Patch pending for GCC13.

2022-04-29 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=105073 Bug 105073 depends on bug 51954, which changed state. Bug 51954 Summary: __int128_t (and long long on x86) negation can be optimized https://gcc.gnu.org/bugzilla/show_bug.cgi?id=51954 What|Removed |Added

[Bug target/51954] __int128_t (and long long on x86) negation can be optimized

2022-04-29 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=51954 Uroš Bizjak changed: What|Removed |Added Status|ASSIGNED|RESOLVED Resolution|---

[Bug target/105209] internal compiler error: in store_data_bypass_p_1

2022-04-11 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=105209 --- Comment #2 from Uroš Bizjak --- Created attachment 52780 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=52780=edit Proposed patch This patch introduces alpha-specific version of store_data_bypass_p that ignores TRAP_IF that would

[Bug tree-optimization/103035] [meta-bug] YARPGen bugs

2022-04-05 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=103035 Bug 103035 depends on bug 105139, which changed state. Bug 105139 Summary: [12 Regression] GCC produces vmovw instruction with an incorrect argument for -O3 -march=sapphirerapids since r12-6215-g708b87dcb6e48cb4

[Bug target/105139] [12 Regression] GCC produces vmovw instruction with an incorrect argument for -O3 -march=sapphirerapids since r12-6215-g708b87dcb6e48cb4

2022-04-05 Thread ubizjak at gmail dot com via Gcc-bugs
at gcc dot gnu.org |ubizjak at gmail dot com Status|NEW |RESOLVED Resolution|--- |FIXED --- Comment #8 from Uroš Bizjak --- Fixed.

[Bug target/105139] [12 Regression] GCC produces vmovw instruction with an incorrect argument for -O3 -march=sapphirerapids since r12-6215-g708b87dcb6e48cb4

2022-04-04 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=105139 --- Comment #6 from Uroš Bizjak --- *movv2qi_internal was not fixed in the same way as *movhi_internal, so: diff --git a/gcc/config/i386/mmx.md b/gcc/config/i386/mmx.md index 29d470bdef2..197f19e4b1a 100644 --- a/gcc/config/i386/mmx.md +++

[Bug target/105136] [11/12 regression] Missed optimization regression with 32-bit adds and shifts

2022-04-04 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=105136 --- Comment #2 from Uroš Bizjak --- The regression in bar: is due to RA regression for: (insn 28 27 29 2 (parallel [ (set (reg:SI 89) (plus:SI (reg:SI 92) (subreg:SI (reg:DI 87) 0)))

[Bug target/105079] _mm_storeu_si16 inefficiently uses pextrw to an integer reg (without SSE4.1)

2022-03-28 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=105079 --- Comment #1 from Uroš Bizjak --- Created attachment 52700 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=52700=edit Proposed patch The attached patch handles the case from Comment #0.

[Bug target/105073] [meta bug]Patch pending for GCC13.

2022-03-28 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=105073 Uroš Bizjak changed: What|Removed |Added Ever confirmed|0 |1 Last reconfirmed|

[Bug target/104781] [12 regression] SEGV in _Unwind_GetGR during i386 Ada bootstrap

2022-03-04 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=104781 --- Comment #3 from Uroš Bizjak --- Comment on attachment 52563 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=52563 A patch >From ba4854c13c4aaa5b50127f23cb09cf05e3eb229d Mon Sep 17 00:00:00 2001 >From: "H.J. Lu" >Date: Fri, 4 Mar 2022

[Bug rtl-optimization/104664] [12 Regression] ICE: in extract_constrain_insn, at recog.cc:2670 (insn does not satisfy its constraints) with -Og -ffinite-math-only

2022-02-24 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=104664 Uroš Bizjak changed: What|Removed |Added Keywords||ra Component|target

<    1   2   3   4   5   6   7   8   9   10   >