[Bug target/112943] [14 Regression] ICE: in gen_reg_rtx, at emit-rtl.cc:1176 with -O2 -march=westmere -mapxf

2023-12-11 Thread wwwhhhyyy333 at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=112943 Hongyu Wang changed: What|Removed |Added CC||wwwhhhyyy333 at gmail dot com

[Bug middle-end/112824] Stack spills and vector splitting with vector builtins

2023-12-06 Thread wwwhhhyyy333 at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=112824 Hongyu Wang changed: What|Removed |Added CC||wwwhhhyyy333 at gmail dot com

[Bug testsuite/112729] gcc.target/i386/apx-interrupt-1.c etc. FAIL

2023-11-28 Thread wwwhhhyyy333 at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=112729 --- Comment #7 from Hongyu Wang --- (In reply to r...@cebitec.uni-bielefeld.de from comment #5) > > Is there a reason to have -fomit-frame-pointer once before and once > after -mapx-features=push2pop2? Ah, thanks for pointing that out. Will

[Bug testsuite/112729] gcc.target/i386/apx-interrupt-1.c etc. FAIL

2023-11-27 Thread wwwhhhyyy333 at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=112729 --- Comment #3 from Hongyu Wang --- Created attachment 56703 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=56703=edit A patch Hi Rainer, can you help verify if the change make these test pass on solaris/FreeBSD?

[Bug testsuite/112729] gcc.target/i386/apx-interrupt-1.c etc. FAIL

2023-11-27 Thread wwwhhhyyy333 at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=112729 --- Comment #2 from Hongyu Wang --- The cfi scan fails was caused by -fno-omit-frame-pointer which force push the frame pointer first and the cfi info become different. By default we have -fomit-frame-pointer on linux, but not other targets.

[Bug target/112394] ICE: in extract_constrain_insn, at recog.cc:2705 insn does not satisfy its constraints: {*vec_extractv2di_1} with -O -mavx512vbmi2 -mapxf -mno-sse4.2

2023-11-07 Thread wwwhhhyyy333 at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=112394 --- Comment #2 from Hongyu Wang --- Should be fixed.

[Bug tree-optimization/112325] New: Missed vectorization after cunrolli

2023-10-31 Thread wwwhhhyyy333 at gmail dot com via Gcc-bugs
-optimization Assignee: unassigned at gcc dot gnu.org Reporter: wwwhhhyyy333 at gmail dot com Target Milestone: --- testcase: #include #include typedef struct { float s; int8_t qs[32]; } block; void foo (const int n, float * restrict s, const int8_t q[4], const block

[Bug target/111127] [13/14 regression] Wrong code for avx512ne2ps2bf16_maskz intrinsics since gcc13

2023-08-24 Thread wwwhhhyyy333 at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=27 Hongyu Wang changed: What|Removed |Added Status|UNCONFIRMED |RESOLVED Resolution|---

[Bug target/111127] New: Wrong code for avx512ne2ps2bf16_maskz intrinsics since gcc13

2023-08-24 Thread wwwhhhyyy333 at gmail dot com via Gcc-bugs
Priority: P3 Component: target Assignee: unassigned at gcc dot gnu.org Reporter: wwwhhhyyy333 at gmail dot com Target Milestone: --- cat test.c #include __m512bh cvttest(__mmask32 k, __m512 a, __m512 b

[Bug rtl-optimization/110215] RA fails to allocate register when loop invariant lives across calls and eh

2023-06-27 Thread wwwhhhyyy333 at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=110215 --- Comment #6 from Hongyu Wang --- Thanks for the fix, now for the attached test, main loop will not have any load. There is a remaining issue that the loop epilogue still contains load from stack and constant pool .L9: movslq

[Bug lto/110424] New: Bogus ODR warning for FMV member function with -flto

2023-06-26 Thread wwwhhhyyy333 at gmail dot com via Gcc-bugs
Component: lto Assignee: unassigned at gcc dot gnu.org Reporter: wwwhhhyyy333 at gmail dot com CC: marxin at gcc dot gnu.org Target Milestone: --- cat m1.h --- #pragma once class A { public: int foo1(); }; --- cat m1.cpp

[Bug rtl-optimization/110215] New: RA fails to allocate register when loop invariant lives through EH region

2023-06-12 Thread wwwhhhyyy333 at gmail dot com via Gcc-bugs
: normal Priority: P3 Component: rtl-optimization Assignee: unassigned at gcc dot gnu.org Reporter: wwwhhhyyy333 at gmail dot com Target Milestone: --- Created attachment 55305 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=55305=edit A Testcase Compi

[Bug libstdc++/110138] Extra constructor called when using basic_string::operator+

2023-06-08 Thread wwwhhhyyy333 at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=110138 Hongyu Wang changed: What|Removed |Added Resolution|--- |INVALID Status|UNCONFIRMED

[Bug libstdc++/110138] Extra constructor called when using basic_string::operator+

2023-06-06 Thread wwwhhhyyy333 at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=110138 --- Comment #1 from Hongyu Wang --- operator+ now calls std::__cxx11::basic_string, myAlloc_ >::get_allocator, and it will call the constructor again after gimplify __attribute__((nodiscard)) struct allocator_type std::__cxx11::basic_string,

[Bug libstdc++/110138] New: Extra constructor called when using basic_string::operator+

2023-06-06 Thread wwwhhhyyy333 at gmail dot com via Gcc-bugs
Priority: P3 Component: libstdc++ Assignee: unassigned at gcc dot gnu.org Reporter: wwwhhhyyy333 at gmail dot com Target Milestone: --- Created attachment 55268 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=55268=edit Simplified test complied with -std=c+

[Bug libgomp/109062] [13 regression] Default value of GOMP_SPINCOUNT changes since r13-2545

2023-03-08 Thread wwwhhhyyy333 at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=109062 Hongyu Wang changed: What|Removed |Added Status|UNCONFIRMED |RESOLVED Resolution|---

[Bug libgomp/109062] New: [13 regression] Default value of GOMP_SPINCOUNT changes since r13-2545

2023-03-07 Thread wwwhhhyyy333 at gmail dot com via Gcc-bugs
Priority: P3 Component: libgomp Assignee: unassigned at gcc dot gnu.org Reporter: wwwhhhyyy333 at gmail dot com CC: jakub at gcc dot gnu.org Target Milestone: --- Recently we found several big regressions on Phoronix OpenMP benchmark

[Bug target/107692] [13 regression] r13-3950-g071e428c24ee8c breaks many test cases

2022-11-23 Thread wwwhhhyyy333 at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=107692 --- Comment #12 from Hongyu Wang --- Fixed for GCC 13. Sorry for introducing this.

[Bug target/107692] [13 regression] r13-3950-g071e428c24ee8c breaks many test cases

2022-11-18 Thread wwwhhhyyy333 at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=107692 --- Comment #9 from Hongyu Wang --- (In reply to Segher Boessenkool from comment #8) > (In reply to Jiu Fu Guo from comment #5) > > > -munroll-only-small-loops does not turn on or off -funroll-loops, and it > > > should not, so that it does

[Bug tree-optimization/107717] [13 Regression] ICEs expanding permutes after g:dc95e1e9702f2f6367bbc108c8d01169be1b66d2

2022-11-18 Thread wwwhhhyyy333 at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=107717 --- Comment #4 from Hongyu Wang --- (In reply to Tamar Christina from comment #3) > Fixed Thanks for the fix! It also give me a good tip for match pattern writing :)

[Bug middle-end/107734] [13 Regression] valgrind error for gcc/testsuite/cc.target/i386/pr46051.c

2022-11-18 Thread wwwhhhyyy333 at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=107734 --- Comment #12 from Hongyu Wang --- (In reply to Andrew Pinski from comment #9) > Fixed. Thanks for the fix! I was not aware that sbitmap does not have a default constructor :(.

[Bug target/107692] [13 regression] r13-3950-g071e428c24ee8c breaks many test cases

2022-11-17 Thread wwwhhhyyy333 at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=107692 --- Comment #6 from Hongyu Wang --- (In reply to Jiu Fu Guo from comment #4) > (In reply to Hongyu Wang from comment #2) > > Created attachment 53897 [details] > > A patch > > > > Sorry for introducing these fails. Here is the patch. > > > >

[Bug target/107692] [13 regression] r13-3950-g071e428c24ee8c breaks many test cases

2022-11-14 Thread wwwhhhyyy333 at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=107692 --- Comment #2 from Hongyu Wang --- Created attachment 53897 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=53897=edit A patch Sorry for introducing these fails. Here is the patch. I've tested the patch with cross-compler and all the

[Bug target/107676] Nonsensical docs for -mrelax-cmpxchg-loop

2022-11-14 Thread wwwhhhyyy333 at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=107676 --- Comment #6 from Hongyu Wang --- (In reply to Andrew Pinski from comment #5) > (In reply to Jonathan Wakely from comment #4) > > I don't think __atomic_compare_exchange emits such a loop. This is about > > __atomic_fetch_xor and friends,

[Bug target/107304] internal compiler error: in convert_move, at expr.cc:220 with -march=tigerlake

2022-10-18 Thread wwwhhhyyy333 at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=107304 --- Comment #10 from Hongyu Wang --- (In reply to H.J. Lu from comment #9) > (In reply to Hongtao.liu from comment #8) > > (In reply to H.J. Lu from comment #7) > > > (In reply to Hongtao.liu from comment #6) > > > > (In reply to Hongtao.liu

[Bug target/106180] [13 Regression] ICE in extract_insn, at recog.cc:2791 since r13-1418-g73f942c08deef3

2022-07-04 Thread wwwhhhyyy333 at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=106180 Hongyu Wang changed: What|Removed |Added CC||wwwhhhyyy333 at gmail dot com

[Bug target/105339] [x86] missing AVX-512F scalef functions when optimization is disabled

2022-04-27 Thread wwwhhhyyy333 at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=105339 --- Comment #7 from Hongyu Wang --- Fixed for gcc-9/10/11/12.

[Bug target/105288] AVX/AVX512 casts should use the "v" constraint

2022-04-15 Thread wwwhhhyyy333 at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=105288 --- Comment #1 from Hongyu Wang --- I think should be these 2? (define_insn_and_split "avx512f__" [(set (match_operand:AVX512MODE2P 0 "nonimmediate_operand" "=x,m") (vec_concat:AVX512MODE2P (vec_concat:

[Bug target/105034] [10/11/12 regression]Suboptimal codegen for min/max with -Os

2022-03-27 Thread wwwhhhyyy333 at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=105034 --- Comment #2 from Hongyu Wang --- For -O2 stv doesn't do such transform Computing gain for chain #1... Instruction gain 8 for 7: {r84:SI=smax(r85:SI,0);clobber flags:CC;} REG_DEAD r85:SI REG_UNUSED flags:CC Instruction

[Bug target/104978] [avx512fp16] wrong code for _mm_mask_fcmadd_round_sch

2022-03-21 Thread wwwhhhyyy333 at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=104978 --- Comment #5 from Hongyu Wang --- Fixed for GCC 12.

[Bug target/104977] [avx512fp16] wrong code for vfmaddcsh when -masm=intel.

2022-03-20 Thread wwwhhhyyy333 at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=104977 --- Comment #3 from Hongyu Wang --- Fixed for GCC 12.

[Bug target/104726] gcc.target/i386/pr104551.c FAILs

2022-03-01 Thread wwwhhhyyy333 at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=104726 --- Comment #7 from Hongyu Wang --- Fixed for GCC 12.

[Bug target/104724] gcc.target/i386/avx512fp16-vcvtsi2sh-1b.c etc. FAIL

2022-03-01 Thread wwwhhhyyy333 at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=104724 --- Comment #4 from Hongyu Wang --- Fixed for GCC 12.

[Bug target/104726] gcc.target/i386/pr104551.c FAILs

2022-03-01 Thread wwwhhhyyy333 at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=104726 Hongyu Wang changed: What|Removed |Added Attachment #52532|0 |1 is obsolete|

[Bug target/104726] gcc.target/i386/pr104551.c FAILs

2022-03-01 Thread wwwhhhyyy333 at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=104726 --- Comment #1 from Hongyu Wang --- Created attachment 52532 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=52532=edit A patch Hi Rainer, can you try this on your solaris system? We don't have such platform to confirm it works. I'll

[Bug target/104724] gcc.target/i386/avx512fp16-vcvtsi2sh-1b.c etc. FAIL

2022-03-01 Thread wwwhhhyyy333 at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=104724 --- Comment #1 from Hongyu Wang --- Created attachment 52531 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=52531=edit A patch Hi Rainer, can you try this on your solaris system? We don't have such platform to confirm it works. I'll

[Bug rtl-optimization/104664] [12 Regression] ICE: in extract_constrain_insn, at recog.cc:2670 (insn does not satisfy its constraints) with -Og -ffinite-math-only

2022-02-28 Thread wwwhhhyyy333 at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=104664 --- Comment #6 from Hongyu Wang --- Fixed for GCC 12.

[Bug rtl-optimization/104664] [12 Regression] ICE: in extract_constrain_insn, at recog.cc:2670 (insn does not satisfy its constraints) with -Og -ffinite-math-only

2022-02-27 Thread wwwhhhyyy333 at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=104664 --- Comment #4 from Hongyu Wang --- (In reply to Uroš Bizjak from comment #3) > Reconfirmed as RA issue. I'm afraid we'd avoid pattern like (insn 180 179 182 2 (set (reg:V8HF 220) (subreg:V8HF

[Bug target/104664] ICE: in extract_constrain_insn, at recog.cc:2670 (insn does not satisfy its constraints) with -Og -ffinite-math-only

2022-02-24 Thread wwwhhhyyy333 at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=104664 --- Comment #2 from Hongyu Wang --- starting from r12-6021

[Bug target/103069] cmpxchg isn't optimized

2022-02-22 Thread wwwhhhyyy333 at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=103069 --- Comment #19 from Hongyu Wang --- (In reply to Thiago Macieira from comment #18) > (In reply to Jakub Jelinek from comment #17) > > _Pragma("GCC target \"relax-cmpxchg-loop\"") > > should do that (ditto target("relax-cmpxchg-loop")

[Bug target/103069] cmpxchg isn't optimized

2022-02-22 Thread wwwhhhyyy333 at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=103069 --- Comment #15 from Hongyu Wang --- (In reply to Thiago Macieira from comment #14) > I'd restrict relaxations to loops emitted by the compiler. All other atomic > operations shouldn't be modified at all, unless the user asks for it. That >

[Bug target/103069] cmpxchg isn't optimized

2022-02-21 Thread wwwhhhyyy333 at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=103069 --- Comment #13 from Hongyu Wang --- All above glibc cases are now both relaxed by an load/cmp to skip cmpxchg under -mrelax-cmpxchg-loop, but for > do > { > flags = THREAD_GETMEM (self, cancelhandling); > newval =

[Bug target/103069] cmpxchg isn't optimized

2022-02-15 Thread wwwhhhyyy333 at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=103069 --- Comment #11 from Hongyu Wang --- For the case with atomic_compare_exchange_weak_release, it can be expanded as loop: mov%eax,%r8d and$0xfff8,%r8d mov(%r8),%rsi <--- load lock first cmp%rsi,%rax <---

[Bug target/103771] New: Missed vectorization under -mavx512f -mavx512vl after r12-5489

2021-12-20 Thread wwwhhhyyy333 at gmail dot com via Gcc-bugs
Priority: P3 Component: target Assignee: unassigned at gcc dot gnu.org Reporter: wwwhhhyyy333 at gmail dot com Target Milestone: --- cat vect.c typedef unsigned char uint8_t; static uint8_t x264_clip_uint8( int x

[Bug target/103571] ABI: V2HF, V4HF and V8HFmode argument passing issues

2021-12-06 Thread wwwhhhyyy333 at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=103571 Hongyu Wang changed: What|Removed |Added CC||wwwhhhyyy333 at gmail dot com

[Bug target/103066] __sync_val_compare_and_swap/__sync_bool_compare_and_swap aren't optimized

2021-11-04 Thread wwwhhhyyy333 at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=103066 --- Comment #1 from Hongyu Wang --- __sync_val_compare_and_swap will be expanded to atomic_compare_exchange_strong by default, should we restrict the check and return under atomic_compare_exchange_weak which is allowed to fail spuriously?

[Bug target/102812] Unoptimal (and wrong) code for _Float16 insert

2021-10-20 Thread wwwhhhyyy333 at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=102812 --- Comment #3 from Hongyu Wang --- (In reply to Uroš Bizjak from comment #2) > Please note that the code above should compile via ix86_expand_vector_set, > similar to: > > --cut here-- > typedef short v8hi

[Bug target/102835] gcc.target/i386/avx512fp16-trunchf.c FAILs

2021-10-19 Thread wwwhhhyyy333 at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=102835 --- Comment #1 from Hongyu Wang --- (In reply to Rainer Orth from comment #0) > > I wonder what's the best way to handle the difference? Just add > -fomit-frame-pointer > to the testcase or allow for the %ebp vs. %esp difference? For this

[Bug target/102806] New: [x86] Suboptimal codegen for v4hi vector concat under -mavx512bw and -mavx512vl

2021-10-18 Thread wwwhhhyyy333 at gmail dot com via Gcc-bugs
Severity: normal Priority: P3 Component: target Assignee: unassigned at gcc dot gnu.org Reporter: wwwhhhyyy333 at gmail dot com Target Milestone: --- For typedef short v8hi __attribute__((vector_size (16))); typedef short v4hi __attribute__((vector_size (8

[Bug tree-optimization/101993] Potential vectorization opportunity when condition checks array address

2021-08-20 Thread wwwhhhyyy333 at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=101993 --- Comment #2 from Hongyu Wang --- (In reply to Richard Biener from comment #1) > We can vectorize this with masked moves when using AVX2. clang seems to > simply remove the test completely - C seems to guarantee that a + i is a > valid

[Bug tree-optimization/101993] New: Potential vectorization opportunity when condition checks array address

2021-08-19 Thread wwwhhhyyy333 at gmail dot com via Gcc-bugs
: normal Priority: P3 Component: tree-optimization Assignee: unassigned at gcc dot gnu.org Reporter: wwwhhhyyy333 at gmail dot com Target Milestone: --- For float foo(int * restrict a, int * restrict res, int n) { int i; for (i = 0; i < 8

[Bug target/101395] [11/12 regression] Compile failure with -march=native -m32 on sapphirerapids

2021-07-12 Thread wwwhhhyyy333 at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=101395 --- Comment #10 from Hongyu Wang --- (In reply to H.J. Lu from comment #9) > Created attachment 51143 [details] > A patch > > Try this instead. This also works.

[Bug target/101395] [11/12 regression] Compile failure with -march=native -m32 on sapphirerapids

2021-07-10 Thread wwwhhhyyy333 at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=101395 --- Comment #4 from Hongyu Wang --- (In reply to H.J. Lu from comment #3) > Created attachment 51125 [details] > An updated patch This works, thanks.

[Bug target/101395] [11/12 regression] Compile failure with -march=native -m32 on sapphirerapids

2021-07-09 Thread wwwhhhyyy333 at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=101395 --- Comment #2 from Hongyu Wang --- (In reply to H.J. Lu from comment #1) > Created attachment 51124 [details] > A patch > > Please test this patch. It doesn't work. I use ./sde-external-8.63.0-2021-01-18-lin/sde -spr -- gcc test.c

[Bug target/101395] New: Compile failure with -march=native -m32 on sapphirerapids

2021-07-09 Thread wwwhhhyyy333 at gmail dot com via Gcc-bugs
Priority: P3 Component: target Assignee: unassigned at gcc dot gnu.org Reporter: wwwhhhyyy333 at gmail dot com Target Milestone: --- cat test.c int main() { return 0; } On sapphire rapids machine, gcc test.c -march=native -m32 will get cc1: error: ‘-muintr

[Bug tree-optimization/98176] Loop invariant memory could not be hoisted when nonpure_call in loop body

2021-07-07 Thread wwwhhhyyy333 at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=98176 --- Comment #9 from Hongyu Wang --- (In reply to Richard Biener from comment #8) > I'm failing to reproduce with the sincos example since sincos is transformed > to __builtin_cexpi for me. When using I always generate sincosf with g++ -Ofast

[Bug target/101276] [i386] Keylocker output should be cleared when instruction reports runtime error.

2021-07-02 Thread wwwhhhyyy333 at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=101276 Hongyu Wang changed: What|Removed |Added Status|UNCONFIRMED |RESOLVED Resolution|---

[Bug target/101276] New: [i386] Keylocker output should be cleared when instruction reports runtime error.

2021-06-30 Thread wwwhhhyyy333 at gmail dot com via Gcc-bugs
Severity: normal Priority: P3 Component: target Assignee: unassigned at gcc dot gnu.org Reporter: wwwhhhyyy333 at gmail dot com Target Milestone: --- Some keylocker instruction will set ZF when runtime occurs, and the output data should be invalid. Current

[Bug tree-optimization/98339] New: GCC could not vectorize loop with conditional reduced add and store

2020-12-16 Thread wwwhhhyyy333 at gmail dot com via Gcc-bugs
-optimization Severity: normal Priority: P3 Component: tree-optimization Assignee: unassigned at gcc dot gnu.org Reporter: wwwhhhyyy333 at gmail dot com Target Milestone: --- For testcase void foo( int* restrict x

[Bug tree-optimization/98176] Loop invariant memory could not be hoisted when nonpure_call in loop body

2020-12-15 Thread wwwhhhyyy333 at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=98176 --- Comment #7 from Hongyu Wang --- (In reply to Richard Biener from comment #5) > Yes. > > For a LIM testcase an example with a memcpy might be more practically > relevant. > > For refactoring I'd start with classifying the unanalyzable refs

[Bug tree-optimization/98176] Loop invariant memory could not be hoisted when nonpure_call in loop body

2020-12-08 Thread wwwhhhyyy333 at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=98176 --- Comment #6 from Hongyu Wang --- (In reply to Richard Biener from comment #5) > (In reply to Hongyu Wang from comment #4) > > (In reply to Richard Biener from comment #3) > > > > > I see ret[0] has store-motion applied. You don't see it

[Bug tree-optimization/98176] Loop invariant memory could not be hoisted when nonpure_call in loop body

2020-12-08 Thread wwwhhhyyy333 at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=98176 --- Comment #4 from Hongyu Wang --- (In reply to Richard Biener from comment #3) > I see ret[0] has store-motion applied. You don't see it vectorized > because GCC doesn't know how to vectorize sincos (or cexpi which is > what it lowers it

[Bug tree-optimization/98176] Loop invariant memory could not be hoisted when nonpure_call in loop body

2020-12-07 Thread wwwhhhyyy333 at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=98176 --- Comment #2 from Hongyu Wang --- >> I doubt the call is the issue btw. The aliasing could be removed by float foo(int *x, int n, float tx) { float ret[n]; #pragma omp simd for (int i = 0; i < n; i++) {

[Bug tree-optimization/98176] New: Loop invariant memory could not be hoisted when nonpure_call in loop body

2020-12-07 Thread wwwhhhyyy333 at gmail dot com via Gcc-bugs
: normal Priority: P3 Component: tree-optimization Assignee: unassigned at gcc dot gnu.org Reporter: wwwhhhyyy333 at gmail dot com Target Milestone: --- For testcase #include void foo(float *x, float tx, float *ret, int n) { #pragma omp simd for (int i = 0

[Bug target/97231] Missing FSF copyright notes for some x86 intrinsic headers

2020-09-28 Thread wwwhhhyyy333 at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=97231 --- Comment #1 from Hongyu Wang --- Created attachment 49280 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=49280=edit A patch

[Bug target/97231] New: Missing FSF copyright notes for some x86 intrinsic headers

2020-09-28 Thread wwwhhhyyy333 at gmail dot com via Gcc-bugs
Priority: P3 Component: target Assignee: unassigned at gcc dot gnu.org Reporter: wwwhhhyyy333 at gmail dot com Target Milestone: --- Many x86 intrinsic header files doesn't have FSF copyright: amxbf16intrin.h amxint8intrin.h amxtileintrin.h avx512vp2intersectintrin.h

[Bug target/96186] New: [11 regressoion] ICE: Unrecognizable insn since r11-1970-fab263ab0fc10ea08409b80afa7e8569438b8d28

2020-07-13 Thread wwwhhhyyy333 at gmail dot com
Severity: normal Priority: P3 Component: target Assignee: unassigned at gcc dot gnu.org Reporter: wwwhhhyyy333 at gmail dot com Target Milestone: --- A testcase extracted from 510.parest_r #include double square(double d[3], double rad

[Bug tree-optimization/92980] [miss optimization]redundant load missed by fre.

2020-01-16 Thread wwwhhhyyy333 at gmail dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92980 --- Comment #11 from Hongyu Wang --- (In reply to rguent...@suse.de from comment #10) > > It has two exits which makes it difficult > Or impossible to make it truly do-while. > But it's close enough and further rotating the loop doesn't make it

[Bug tree-optimization/92980] [miss optimization]redundant load missed by fre.

2020-01-15 Thread wwwhhhyyy333 at gmail dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92980 --- Comment #9 from Hongyu Wang --- (In reply to Hongtao.liu from comment #6) > New fail by removal > unix/-m32: gcc.dg/tree-ssa/copy-headers-5.c scan-tree-dump ch2 "is now > do-while loop" > unix/-m32: gcc.dg/tree-ssa/copy-headers-5.c

[Bug target/92651] [10 Regression] Unnecessary stv transform in some x86 backend

2019-12-16 Thread wwwhhhyyy333 at gmail dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92651 Hongyu Wang changed: What|Removed |Added Status|UNCONFIRMED |RESOLVED Resolution|---

[Bug bootstrap/92927] New: Bootstrap failed with CXXFLAGS="-O0"

2019-12-12 Thread wwwhhhyyy333 at gmail dot com
bootstrap Assignee: unassigned at gcc dot gnu.org Reporter: wwwhhhyyy333 at gmail dot com Target Milestone: --- configure GCC with: --enable-languages=c,c++,fortran --enable-bootstrap CXXFLAGS="-O0" At stage3, it throws error: /usr/bin/ld: /home/hongyuw1/gcc/build_

[Bug target/92651] [10 Regression] Unnecessary stv transform in some x86 backend

2019-11-26 Thread wwwhhhyyy333 at gmail dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92651 --- Comment #7 from Hongyu Wang --- (In reply to rguent...@suse.de from comment #6) > On Tue, 26 Nov 2019, wwwhhhyyy333 at gmail dot com wrote: > > > https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92651 > > > > --- Com

[Bug target/92651] [10 Regression] Unnecessary stv transform in some x86 backend

2019-11-25 Thread wwwhhhyyy333 at gmail dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92651 --- Comment #5 from Hongyu Wang --- (In reply to Jakub Jelinek from comment #3) > Do you mean r274481 rather than r277481, right? Yes. Thanks for your correction.

[Bug target/92651] [10 Regression] Unnecessary stv transform in some x86 backend

2019-11-25 Thread wwwhhhyyy333 at gmail dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92651 --- Comment #4 from Hongyu Wang --- (In reply to Richard Biener from comment #2) > Btw, which variant is actually the fastest for you? abs expansion doesn't > do any cost comparison but just uses direct abs, max and then the xor with > shift

[Bug target/92651] New: [10 Regression] Unnecessary stv transform in some x86 backend

2019-11-24 Thread wwwhhhyyy333 at gmail dot com
Priority: P3 Component: target Assignee: unassigned at gcc dot gnu.org Reporter: wwwhhhyyy333 at gmail dot com CC: rguenther at suse dot de Target Milestone: --- For test case #include int foo(unsigned char a, unsigned char b) { int isum=abs