[Bug target/96793] __builtin_floor produces wrong result when rounding direction is FE_DOWNWARD

2020-12-22 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=96793 --- Comment #19 from Uroš Bizjak --- https://gcc.gnu.org/g:337ed0eb490b14899f4049bc4c8922eb1d8a2e67 commit r11-6303-g337ed0eb490b14899f4049bc4c8922eb1d8a2e67 Author: Uros Bizjak Date: Tue Dec 22 18:13:24 2020 +0100 i386: Fix __builtin_

[Bug target/96793] __builtin_floor produces wrong result when rounding direction is FE_DOWNWARD

2020-12-22 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=96793 --- Comment #20 from Uroš Bizjak --- https://gcc.gnu.org/g:0bf0e0b86d3e2f12555479096baaf0ca7a9f7ac6 commit r10-9164-g0bf0e0b86d3e2f12555479096baaf0ca7a9f7ac6 Author: Uros Bizjak Date: Tue Dec 22 21:11:51 2020 +0100 i386: Fix __builtin_fl

[Bug target/96793] __builtin_floor produces wrong result when rounding direction is FE_DOWNWARD

2020-12-23 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=96793 --- Comment #21 from Uroš Bizjak --- https://gcc.gnu.org/g:c40b640ebcef1aae78eaca56e04d204dda9e4cad commit r9-9126-gc40b640ebcef1aae78eaca56e04d204dda9e4cad Author: Uros Bizjak Date: Wed Dec 23 09:09:29 2020 +0100 i386: Fix __builtin_f

[Bug target/96793] __builtin_floor produces wrong result when rounding direction is FE_DOWNWARD

2020-12-23 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=96793 --- Comment #22 from Uroš Bizjak --- https://gcc.gnu.org/g:edb28850520d1137d12a1cc1c0e89c11e6b0c6ef commit r8-10691-gedb28850520d1137d12a1cc1c0e89c11e6b0c6ef Author: Uros Bizjak Date: Wed Dec 23 09:18:12 2020 +0100 i386: Fix __builtin_

[Bug target/96793] __builtin_floor produces wrong result when rounding direction is FE_DOWNWARD

2020-12-23 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=96793 Uroš Bizjak changed: What|Removed |Added Resolution|--- |FIXED Status|ASSIGNED

[Bug target/92658] x86 lacks vector extend / truncate

2020-12-25 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92658 --- Comment #22 from Uroš Bizjak --- (In reply to Hongtao.liu from comment #21) > Add define_code_attr like aarch64/iterators.md? > > -- > ;; Map rtl objects to optab names > (define_code_attr optab [(ashift "ashl") >

[Bug rtl-optimization/98439] [11 Regression] ICE in final_scan_insn_1, at final.c:3096

2020-12-27 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=98439 Uroš Bizjak changed: What|Removed |Added Component|target |rtl-optimization Last reconfirmed|

[Bug rtl-optimization/97684] [11 Regression] ICE in reg_preferred_class, at reginfo.c:789

2020-12-27 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=97684 Uroš Bizjak changed: What|Removed |Added Component|target |rtl-optimization Ever confirmed|0

[Bug rtl-optimization/98439] [11 Regression] ICE in final_scan_insn_1, at final.c:3096

2020-12-28 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=98439 --- Comment #3 from Uroš Bizjak --- I don't think this is a backend bug. The position of split pass in the pass sequence assumes that no split candidates will be emitted after regstack, as can be seen from the gate function of the pass_split_for_

[Bug target/64243] Passing and returning structures with single member of floating type via SSE registers is wrong on Windows x86-64 ABI

2020-12-30 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=64243 Uroš Bizjak changed: What|Removed |Added CC||10walls at gmail dot com --- Comment #4 fr

[Bug target/64243] Passing and returning structures with single member of floating type via SSE registers is wrong on Windows x86-64 ABI

2020-12-30 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=64243 --- Comment #5 from Uroš Bizjak --- This is fixed in gcc-11: --cut here-- struct TestFloat { float x; }; struct TestDouble { double x; }; struct TestFloat foo (struct TestFloat x) { return x; } struct TestDouble bar (struct TestDouble x) { retu

[Bug target/64243] Passing and returning structures with single member of floating type via SSE registers is wrong on Windows x86-64 ABI

2020-12-30 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=64243 Uroš Bizjak changed: What|Removed |Added Status|UNCONFIRMED |RESOLVED Known to work|

[Bug target/98522] _mm_cvttps_pi32 and _mm_cvtps_pi32 raise spurious FP exceptions

2021-01-05 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=98522 Uroš Bizjak changed: What|Removed |Added Status|UNCONFIRMED |ASSIGNED Last reconfirmed|

[Bug target/98521] [x86] _mm256_cmov_si256 XOP function is missing

2021-01-05 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=98521 Uroš Bizjak changed: What|Removed |Added Ever confirmed|0 |1 Assignee|unassigned at gcc dot

[Bug target/98521] [x86] _mm256_cmov_si256 XOP function is missing

2021-01-06 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=98521 Uroš Bizjak changed: What|Removed |Added Resolution|--- |FIXED Status|ASSIGNED

[Bug target/98522] _mm_cvttps_pi32 and _mm_cvtps_pi32 raise spurious FP exceptions

2021-01-06 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=98522 Uroš Bizjak changed: What|Removed |Added Target Milestone|--- |10.3 Status|ASSIGNED

[Bug target/98567] Failure to optimize using ZF flag from blsi

2021-01-06 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=98567 --- Comment #2 from Uroš Bizjak --- Comment on attachment 49901 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=49901 gcc11-pr98567.patch >+(define_insn "*bmi_blsi__cmp" >+ [(set (reg:CCZ FLAGS_REG) >+ (compare:CCZ >+(and:SWI4

[Bug target/98482] -mfentry creates invalid call for -mcmodel=large

2021-01-07 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=98482 --- Comment #2 from Uroš Bizjak --- (In reply to Hongtao.liu from comment #1) > and by the time of output __fentry__ in gcc, register is already accocated, > is there any regs supposed to be safe in the entry of function? or we need > to spill re

[Bug target/98482] -mfentry creates invalid call for -mcmodel=large

2021-01-07 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=98482 --- Comment #3 from Uroš Bizjak --- (In reply to Uroš Bizjak from comment #2) > (In reply to Hongtao.liu from comment #1) > > and by the time of output __fentry__ in gcc, register is already accocated, > > is there any regs supposed to be safe in

[Bug target/98482] -mfentry creates invalid call for -mcmodel=large

2021-01-07 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=98482 --- Comment #5 from Uroš Bizjak --- (In reply to Topi Miettinen from comment #4) > Sorry, I didn't check the ABI. It seems that %r11 and maybe %r10 should be > usable: %r11 is already used as PROFILE_COUNT_REGISTER for !NO_PROFILE_COUNTERS targe

[Bug target/98482] -mfentry creates invalid call for -mcmodel=large

2021-01-08 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=98482 --- Comment #9 from Uroš Bizjak --- (In reply to Topi Miettinen from comment #8) > I'm unfortunately ignorant to GCC internals and usage of %r10, but otherwise > the patch looks good to me. > > For -mcmodel=large -fPIC, the call sequence probabl

[Bug target/98482] -mfentry creates invalid call for -mcmodel=large

2021-01-08 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=98482 --- Comment #14 from Uroš Bizjak --- (In reply to Jakub Jelinek from comment #10) > If we are emitting for nested functions > pushq %r10 > 1:call__fentry__ > popq%r10 > (is it ok to misalign the stack for __fentry__? but

[Bug target/98671] gcc/config/i386/i386-options.c:787:redundantAssignment

2021-01-14 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=98671 Uroš Bizjak changed: What|Removed |Added Status|UNCONFIRMED |ASSIGNED Ever confirmed|0

[Bug target/98671] gcc/config/i386/i386-options.c:787:redundantAssignment

2021-01-14 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=98671 Uroš Bizjak changed: What|Removed |Added Resolution|--- |FIXED Status|ASSIGNED

[Bug target/98683] Non-canonical compare produced with the VAX backend

2021-01-14 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=98683 --- Comment #1 from Uroš Bizjak --- Maybe TARGET_CANONICALIZE_COMPARISON would help here? x86 had a similar issue with ficom x87 insn where float RTX was always the first operand, but the compare was with the float extend of the second one.

[Bug target/98671] gcc/config/i386/i386-options.c:787:redundantAssignment

2021-01-14 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=98671 --- Comment #6 from Uroš Bizjak --- (In reply to David Binderman from comment #5) > (In reply to Uroš Bizjak from comment #4) > > I'm not sure if solving this would bring us anything. > > For clarity, at very most a 4% reduction in the size of t

[Bug tree-optimization/96674] Failure to optimize combination of comparisons to dec+compare

2021-01-14 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=96674 --- Comment #8 from Uroš Bizjak --- Comment on attachment 49969 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=49969 Optimize combination of comparisons to dec+compare >+/* y == XXX_MIN || x < y --> x <= y - 1 */ Can we use TYPE_MIN inste

[Bug middle-end/98713] Failure to generate branch version of abs if user requested it

2021-01-18 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=98713 --- Comment #4 from Uroš Bizjak --- Please see PR 56309 (and PR 85559 meta bug). Quote from Honza: The decision on whether to use cmov or jmp was always tricky on x86 architectures. Cmov increase dependency chains, register pressure (both value

[Bug ada/98724] [11 Regression] gnat build failure on alpha-linux-gnu

2021-01-18 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=98724 --- Comment #1 from Uroš Bizjak --- Sorry, I don't have access to alpha anymore. (And I'm surprised that gnat even builds, because I've never tried.)

[Bug target/98612] _mm_comieq_sd has wrong semantics

2021-01-18 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=98612 --- Comment #8 from Uroš Bizjak --- (In reply to Hongtao.liu from comment #7) > I asked my colleagues within intel to revise the descriptions in the > intrinsics guide to make it more explicit about NAN operands. > > I'll fix this issue after th

[Bug target/98737] Atomic operation on x86 no optimized to use flags

2021-01-19 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=98737 --- Comment #2 from Uroš Bizjak --- This can be optimized with peephole2, we already have similar case in sync.md: ;; This peephole2 and following insn optimize ;; __sync_fetch_and_add (x, -N) == N into just lock {add,sub,inc,dec} ;; followed by

[Bug rtl-optimization/98961] Failure to optimize successive comparisons with 0 into clz

2021-02-04 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=98961 Uroš Bizjak changed: What|Removed |Added CC||ubizjak at gmail dot com --- Comment #2 fr

[Bug rtl-optimization/98961] Failure to optimize successive comparisons with 0 into clz

2021-02-04 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=98961 --- Comment #3 from Uroš Bizjak --- Please note that LZCNT insn has it own set of problems (e.g. TARGET_AVOID_FALSE_DEP_FOR_BMI), so I'm not convinced that even: int z (int i) { return i == 0; } benefits from using LZCNT: 0: 31 c0

[Bug rtl-optimization/98962] Perform bitops on floats directly with SSE

2021-02-04 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=98962 --- Comment #4 from Uroš Bizjak --- (In reply to Jakub Jelinek from comment #3) > Another possibility is add x/v constraints to *andsi_1 and *anddi_1 with the > immediates and disparage that alternative enough to reflect the fact that > the immed

[Bug target/99025] [11 Regression] ICE Segmentation fault since r11-6351-g12ae2bc70846a2be

2021-02-10 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=99025 --- Comment #2 from Uroš Bizjak --- Comment on attachment 50154 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=50154 gcc11-pr99025.patch >2021-02-09 Jakub Jelinek >+ if (SUBREG_P (operands[1])) >+operands[1] = force_reg (V2SFmode,

[Bug target/99083] Big run-time regressions of 519.lbm_r with LTO

2021-02-13 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=99083 --- Comment #1 from Uroš Bizjak --- This should be a no-op. According to the documentation: --q-- Macro: REG_ALLOC_ORDER If defined, an initializer for a vector of integers, containing the numbers of hard registers in the order in which GCC

[Bug target/99083] Big run-time regressions of 519.lbm_r with LTO

2021-02-15 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=99083 --- Comment #3 from Uroš Bizjak --- It looks to me another one is in reload1.c, find_reg: if (this_cost < best_cost /* Among registers with equal cost, prefer caller-saved ones, or use REG_ALLOC_ORDER if

[Bug target/99083] Big run-time regressions of 519.lbm_r with LTO

2021-02-15 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=99083 --- Comment #4 from Uroš Bizjak --- Created attachment 50185 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=50185&action=edit Proposed patch Proposed patch that fixes ira-color.c and introduces HONOR_REG_ALLOC_ORDER.

[Bug target/99083] Big run-time regressions of 519.lbm_r with LTO

2021-02-15 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=99083 --- Comment #5 from Uroš Bizjak --- Martin, can you please benchmark the patch from Comment #4? The patch is not totally trivial, because it introduces HONOR_REG_ALLOC_ORDER to x86 and this define disables some other code in ira-color.c, assign_

[Bug target/99083] Big run-time regressions of 519.lbm_r with LTO

2021-02-15 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=99083 --- Comment #6 from Uroš Bizjak --- As a side note, it is strange that ADJUST_REG_ALLOC_ORDER somehow require REG_ALLOC_ORDER to be defined (c.f. Comment #3), while its documentation says: The macro body should not assume anything about the

[Bug target/99083] Big run-time regressions of 519.lbm_r with LTO

2021-02-15 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=99083 --- Comment #8 from Uroš Bizjak --- (In reply to Richard Biener from comment #7) > Btw, for GCC 11 it might be tempting to simply revert the "no-op" change? I agree, this is the safest way at this time. The situation now looks like going into ra

[Bug target/99083] Big run-time regressions of 519.lbm_r with LTO

2021-02-15 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=99083 --- Comment #10 from Uroš Bizjak --- (In reply to Richard Biener from comment #7) > There are a lot of targets that define REG_ALLOC_ORDER ^ > HONOR_REG_ALLOC_ORDER and thus are affected by this change... The following patch should solve this is

[Bug target/99115] ICE in extract_insn, at recog.c:2309 on alpha (error: unrecognizable insn) with -O2

2021-02-16 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=99115 Uroš Bizjak changed: What|Removed |Added Known to work||11.0 --- Comment #3 from Uroš Bizjak ---

[Bug target/99115] ICE in extract_insn, at recog.c:2309 on alpha (error: unrecognizable insn) with -O2

2021-02-17 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=99115 --- Comment #4 from Uroš Bizjak --- Compiles OK with: GNU C++14 (GCC) version 8.4.1 20210216 [releases/gcc-8 revision c6513400d84:39c49bc104d:1f3a07da9b6bcfa4733750826746bd18ac6f20db] (alpha-unknown-openbsd6.8) built as a cross from x86_64-linu

[Bug target/99083] Big run-time regressions of 519.lbm_r with LTO

2021-02-21 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=99083 Uroš Bizjak changed: What|Removed |Added Resolution|--- |FIXED Status|NEW

[Bug target/99083] Big run-time regressions of 519.lbm_r with LTO

2021-02-21 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=99083 Uroš Bizjak changed: What|Removed |Added Target Milestone|--- |11.0

[Bug middle-end/26163] [meta-bug] missed optimization in SPEC (2k17, 2k and 2k6 and 95)

2021-02-21 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=26163 Bug 26163 depends on bug 99083, which changed state. Bug 99083 Summary: Big run-time regressions of 519.lbm_r with LTO https://gcc.gnu.org/bugzilla/show_bug.cgi?id=99083 What|Removed |Added -

[Bug target/99083] Big run-time regressions of 519.lbm_r with LTO

2021-02-25 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=99083 Uroš Bizjak changed: What|Removed |Added Status|RESOLVED|REOPENED Keywords|

[Bug middle-end/26163] [meta-bug] missed optimization in SPEC (2k17, 2k and 2k6 and 95)

2021-02-25 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=26163 Bug 26163 depends on bug 99083, which changed state. Bug 99083 Summary: Big run-time regressions of 519.lbm_r with LTO https://gcc.gnu.org/bugzilla/show_bug.cgi?id=99083 What|Removed |Added -

[Bug target/82735] _mm256_zeroupper does not invalidate previously computed registers

2021-04-28 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=82735 --- Comment #8 from Uroš Bizjak --- (In reply to Hongtao.liu from comment #7) > Confirmed, let me fix this. Please note that the current definition of vzeroupper does not model effects of the instruction at all. The current definition is intende

[Bug target/82735] _mm256_zeroupper does not invalidate previously computed registers

2021-04-28 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=82735 --- Comment #9 from Uroš Bizjak --- (In reply to Richard Biener from comment #4) > Indeed as far as I understand an unspec volatile isn't sth clobbering > registers (not even memory?!). The insn is missing inputs/outputs > (we might be able to m

[Bug target/82735] _mm256_zeroupper does not invalidate previously computed registers

2021-04-28 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=82735 --- Comment #11 from Uroš Bizjak --- (In reply to Uroš Bizjak from comment #9) > (In reply to Richard Biener from comment #4) > > Indeed as far as I understand an unspec volatile isn't sth clobbering > > registers (not even memory?!). The insn i

[Bug target/100182] [8/9/10/11/12 Regression] Miscompilation of atomic_float/1.cc and atomic_float/wait_notify.cc on i686

2021-04-28 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100182 Uroš Bizjak changed: What|Removed |Added Resolution|--- |FIXED Status|ASSIGNED

[Bug target/100312] __builtin_ia32_maskloadpd256 and friends should be pure

2021-04-29 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100312 Uroš Bizjak changed: What|Removed |Added Assignee|rguenth at gcc dot gnu.org |ubizjak at gmail dot com

[Bug target/98060] Failure to optimize cmp+setnb+add to cmp+sbb

2021-04-30 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=98060 Uroš Bizjak changed: What|Removed |Added Status|ASSIGNED|RESOLVED Resolution|---

[Bug other/98375] [meta bug] GCC 12 pending patches

2021-04-30 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=98375 Bug 98375 depends on bug 98060, which changed state. Bug 98060 Summary: Failure to optimize cmp+setnb+add to cmp+sbb https://gcc.gnu.org/bugzilla/show_bug.cgi?id=98060 What|Removed |Added ---

[Bug testsuite/100355] gcc.c-torture/execute/ieee/cdivchkld.c needs fmaxl

2021-05-03 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100355 --- Comment #3 from Uroš Bizjak --- (In reply to Christophe Lyon from comment #2) > Tried that, but it's not taken into account. > > ieee.exp uses c-torture-execute, maybe that function does not honor dg > directives? (none of the tests under i

[Bug rtl-optimization/100342] [10/11 Regression] wrong code with -O2 -fno-dse -fno-forward-propagate -mno-sse2

2021-05-04 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100342 --- Comment #3 from Uroš Bizjak --- For some reason the *input* value at BSWAP insn is truncated to 32bits. v256u128 v256u128_1 = SHLV (SHLSV (__builtin_bswap64 (u128_0), (v256u128) (0 < v256u128_0)) <= 0, v256u128_0); u128_0 i

[Bug rtl-optimization/100342] [10/11 Regression] wrong code with -O2 -fno-dse -fno-forward-propagate -mno-sse2

2021-05-04 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100342 --- Comment #4 from Uroš Bizjak --- The problematic insn is: 401cec: 44 89 f6mov%r14d,%esi This one should be 64 bit wide, movl%r14d, %esi # 613 [c=4 l=3] *movqi_internal/2 but is actually a QIm

[Bug rtl-optimization/100342] [10/11 Regression] wrong code with -O2 -fno-dse -fno-forward-propagate -mno-sse2

2021-05-04 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100342 --- Comment #5 from Uroš Bizjak --- The problem can be seen in _.pro_and_epilogue pass: Starting with: _.cmpelim 2741: r14:DI=[sp:DI+0x38] ... 368: di:DI=r14:DI ... 613: si:QI=r14:QI ... 2737: bp:DI=r14:DI ... 658: strict_low_

[Bug rtl-optimization/100342] [10/11 Regression] wrong code with -O2 -fno-dse -fno-forward-propagate -mno-sse2

2021-05-04 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100342 --- Comment #7 from Uroš Bizjak --- I have traced a bit where (insn 2275) and (insn 2287) come from. In _.ira, we have: 613: r125:QI=r2067:DI#0 ... 659: zero_extract(r2080:DI,0x8,0x8)=r125:QI#0 And in _.reload, a DImode reload is insert

[Bug rtl-optimization/100342] [10/11 Regression] wrong code with -O2 -fno-dse -fno-forward-propagate -mno-sse2

2021-05-04 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100342 --- Comment #8 from Uroš Bizjak --- FYI, this whole analysis was done with Fedora 33 system compiler: gcc version 10.3.1 20210422 (Red Hat 10.3.1-1) (GCC)

[Bug other/98375] [meta bug] GCC 12 pending patches

2021-05-05 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=98375 Bug 98375 depends on bug 98218, which changed state. Bug 98218 Summary: [TARGET_MMX_WITH_SSE] Miss vec_cmpmn/vcondmn expander for 64bit vector https://gcc.gnu.org/bugzilla/show_bug.cgi?id=98218 What|Removed |Ad

[Bug target/98218] [TARGET_MMX_WITH_SSE] Miss vec_cmpmn/vcondmn expander for 64bit vector

2021-05-05 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=98218 Uroš Bizjak changed: What|Removed |Added Status|ASSIGNED|RESOLVED Resolution|---

[Bug target/100445] [12 Regression] ice during RTL pass: vregs

2021-05-06 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100445 --- Comment #5 from Uroš Bizjak --- ix86_expand_sse_movcc has special TARGET_XOP path, so the following patch is needed: diff --git a/gcc/config/i386/mmx.md b/gcc/config/i386/mmx.md index 347295afbb5..667dd057e0d 100644 --- a/gcc/config/i386/mm

[Bug target/100445] [12 Regression] ice during RTL pass: vregs

2021-05-06 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100445 --- Comment #6 from Uroš Bizjak --- (In reply to Uroš Bizjak from comment #5) > ix86_expand_sse_movcc has special TARGET_XOP path, so the following patch is > needed: Ah, you beat me by the second ;) Anyway, I have no XOP target, so probably y

[Bug target/100445] [12 Regression] ice during RTL pass: vregs

2021-05-06 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100445 --- Comment #9 from Uroš Bizjak --- ix86_use_mask_cmp_p should be refined, it has an early return for 64bit modes: if (GET_MODE_SIZE (mode) == 64) return true;

[Bug target/100445] [12 Regression] ice during RTL pass: vregs

2021-05-06 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100445 --- Comment #10 from Uroš Bizjak --- Following patch fixes the failures: --cut here-- diff --git a/gcc/config/i386/i386-expand.c b/gcc/config/i386/i386-expand.c index 4dfe7d6c282..61b2f921f41 100644 --- a/gcc/config/i386/i386-expand.c +++ b/gcc

[Bug target/100461] [11/12 Regression] mingw build broken due to change of rdtsc implementation

2021-05-06 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100461 Uroš Bizjak changed: What|Removed |Added CC||hjl.tools at gmail dot com --- Comment #1

[Bug target/98218] [TARGET_MMX_WITH_SSE] Implement 64bit vector compares (AVX512 masked compares missing)

2021-05-11 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=98218 Uroš Bizjak changed: What|Removed |Added Summary|[TARGET_MMX_WITH_SSE] Miss |[TARGET_MMX_WITH_SSE] |v

[Bug other/98375] [meta bug] GCC 12 pending patches

2021-05-11 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=98375 Bug 98375 depends on bug 98218, which changed state. Bug 98218 Summary: [TARGET_MMX_WITH_SSE] Implement 64bit vector compares (AVX512 masked compares missing) https://gcc.gnu.org/bugzilla/show_bug.cgi?id=98218 What|Removed

[Bug target/98218] [TARGET_MMX_WITH_SSE] Implement 64bit vector compares (AVX512 masked compares missing)

2021-05-11 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=98218 Uroš Bizjak changed: What|Removed |Added Assignee|ubizjak at gmail dot com |unassigned at gcc dot gnu.org ---

[Bug target/98218] [TARGET_MMX_WITH_SSE] Implement 64bit vector compares (AVX512 masked compares missing)

2021-05-12 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=98218 --- Comment #12 from Uroš Bizjak --- (In reply to David Binderman from comment #11) > I might be seeing something similar: > > caxcpy.f: In function 'caxcpy': > caxcpy.f:53:72: error: unrecognizable insn: >53 | end subroutine > |

[Bug target/98218] [TARGET_MMX_WITH_SSE] Implement 64bit vector compares (AVX512 masked compares missing)

2021-05-12 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=98218 --- Comment #13 from Uroš Bizjak --- (In reply to Uroš Bizjak from comment #12) > Yeah, this is a non-existent SSE "cmove". I tried to find all paths where > this should divert to a sequence of logic instructions or PBLENDB, but due > to plethora

[Bug target/100581] [12 Regression] ICE in extract_insn, at recog.c:2770 since r12-731-gb1f7fd8a2a5558da

2021-05-13 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100581 Uroš Bizjak changed: What|Removed |Added Assignee|unassigned at gcc dot gnu.org |ubizjak at gmail dot com

[Bug target/100581] [12 Regression] ICE in extract_insn, at recog.c:2770 since r12-731-gb1f7fd8a2a5558da

2021-05-13 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100581 --- Comment #3 from Uroš Bizjak --- (In reply to Alex Coplan from comment #1) > Is it valid to create a vector type with total size less than the element > size? Shouldn't this be rejected? No, the generated code is: vmovq ff_b(%rip)

[Bug target/100581] [12 Regression] ICE in extract_insn, at recog.c:2770 since r12-731-gb1f7fd8a2a5558da

2021-05-13 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100581 Uroš Bizjak changed: What|Removed |Added Status|ASSIGNED|RESOLVED Resolution|---

[Bug target/98218] [TARGET_MMX_WITH_SSE] Implement 64bit vector compares (AVX512 masked compares missing)

2021-05-13 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=98218 --- Comment #16 from Uroš Bizjak --- (In reply to David Binderman from comment #15) > Bug first appears sometime between git hash 21dfb22920ce32fc, > dated yesterday and git hash 097fde5e7514e909, dated today. Fixed by PR100581.

[Bug target/100626] [11/12 Regression] ICE Segmentation fault (during RTL pass: split1) since r11-165-geb72dc663e9070b2

2021-05-17 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100626 --- Comment #3 from Uroš Bizjak --- *di3_doubleword calls split_double_mode with: op0: (subreg:DI (reg/v:SI 89 [ li_18 ]) 0) op1: (reg:DI 90 [ uc_4 ]) op2: (mem/c:DI (plus:SI (reg/f:SI 19 frame) (const_int -4 [0xfffc]))

[Bug target/100637] New: [i386] Vectorize 4-byte vectors

2021-05-17 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100637 Bug ID: 100637 Summary: [i386] Vectorize 4-byte vectors Product: gcc Version: 12.0 Status: UNCONFIRMED Severity: normal Priority: P3 Component: target

[Bug target/100637] [i386] Vectorize 4-byte vectors

2021-05-17 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100637 Uroš Bizjak changed: What|Removed |Added Status|UNCONFIRMED |ASSIGNED Assignee|unassigned at

[Bug target/100626] [11/12 Regression] ICE Segmentation fault (during RTL pass: split1) since r11-165-geb72dc663e9070b2

2021-05-17 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100626 Uroš Bizjak changed: What|Removed |Added Assignee|unassigned at gcc dot gnu.org |ubizjak at gmail dot com

[Bug tree-optimization/100696] New: mult_higpart is not vectorized

2021-05-20 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100696 Bug ID: 100696 Summary: mult_higpart is not vectorized Product: gcc Version: 12.0 Status: UNCONFIRMED Severity: normal Priority: P3 Component: tree-optimization

[Bug target/100701] [12 Regression] wrong code with -O -fschedule-insns2

2021-05-20 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100701 Uroš Bizjak changed: What|Removed |Added Status|NEW |ASSIGNED Assignee|unassigned at

[Bug target/100701] [12 Regression] wrong code with -O -fschedule-insns2

2021-05-20 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100701 Uroš Bizjak changed: What|Removed |Added Resolution|--- |FIXED Status|ASSIGNED

[Bug tree-optimization/100696] mult_higpart is not vectorized

2021-05-20 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100696 Uroš Bizjak changed: What|Removed |Added CC||rsandifo at gcc dot gnu.org --- Comment #

[Bug c/100722] ice in extract_insn, at recog.c:2770

2021-05-22 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100722 Uroš Bizjak changed: What|Removed |Added Last reconfirmed||2021-05-22 Target Milestone|---

[Bug target/100722] [12 Regression] ice in extract_insn with many vector_size(4) arguments

2021-05-23 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100722 Uroš Bizjak changed: What|Removed |Added Status|ASSIGNED|RESOLVED Resolution|---

[Bug target/100626] [11/12 Regression] ICE Segmentation fault (during RTL pass: split1) since r11-165-geb72dc663e9070b2

2021-05-25 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100626 Uroš Bizjak changed: What|Removed |Added Status|ASSIGNED|RESOLVED Resolution|---

[Bug middle-end/100880] New: The build should error out for define_insn without insn template

2021-06-02 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100880 Bug ID: 100880 Summary: The build should error out for define_insn without insn template Product: gcc Version: 12.0 Status: UNCONFIRMED Severity: normal

[Bug target/100936] New: %p and %P modifiers should not emit segment overrides

2021-06-06 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100936 Bug ID: 100936 Summary: %p and %P modifiers should not emit segment overrides Product: gcc Version: 12.0 Status: UNCONFIRMED Severity: normal Priority: P3 Compo

[Bug target/100936] %p and %P modifiers should not emit segment overrides

2021-06-06 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100936 --- Comment #1 from Uroš Bizjak --- Proposed patch: --cut here-- diff --git a/gcc/config/i386/i386.c b/gcc/config/i386/i386.c index 04649b42122..0773a4a9ba8 100644 --- a/gcc/config/i386/i386.c +++ b/gcc/config/i386/i386.c @@ -13531,7 +13531,7 @

[Bug target/43526] ICE: in construct_container, at config/i386/i386.c:5733 with -m96bit-long-double at x86_64-linux

2021-06-08 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=43526 Uroš Bizjak changed: What|Removed |Added Resolution|--- |INVALID Status|UNCONFIRMED

[Bug target/100936] %p and %P modifiers should not emit segment overrides

2021-06-09 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100936 Uroš Bizjak changed: What|Removed |Added Resolution|--- |FIXED Assignee|unassigned at gc

[Bug target/101021] New: PSHUFB is emitted instead of PSHUFD, PSHUFLW and PSHUFHW with -msse4

2021-06-10 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=101021 Bug ID: 101021 Summary: PSHUFB is emitted instead of PSHUFD, PSHUFLW and PSHUFHW with -msse4 Product: gcc Version: 12.0 Status: UNCONFIRMED Severity: normal

[Bug target/101021] PSHUFB is emitted instead of PSHUFD, PSHUFLW and PSHUFHW with -msse4

2021-06-11 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=101021 Uroš Bizjak changed: What|Removed |Added Last reconfirmed||2021-06-11 Ever confirmed|0

[Bug target/101023] [11/12 Regression] wrong code with -mstackrealign

2021-06-11 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=101023 Uroš Bizjak changed: What|Removed |Added CC||hjl.tools at gmail dot com --- Comment #1

[Bug target/101021] PSHUFB is emitted instead of PSHUFD, PSHUFLW and PSHUFHW with -msse4

2021-06-11 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=101021 Uroš Bizjak changed: What|Removed |Added Resolution|--- |FIXED Status|ASSIGNED

[Bug rtl-optimization/101044] -ABS(A) produces two neg instructions

2021-06-13 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=101044 --- Comment #1 from Uroš Bizjak --- The first neg also sets sign flag (SF) for the following CMOVS.

[Bug target/101058] [12 Regression] ICE in extract_insn, at recog.c:2770 since r12-1215-g8d7dae0eb366a88a

2021-06-14 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=101058 Uroš Bizjak changed: What|Removed |Added Last reconfirmed||2021-06-14 Assignee|unassigned

<    1   2   3   4   5   6   7   8   9   10   >