from:"Yuliang Wang"

[match.pd] Mid-end fix for r277110

2019-10-18 Thread Yuliang Wang

; Built and tested on aarch64-none-elf. Regards, Yuliang Wang gcc/ChangeLog: 2019-10-17 Yuliang Wang * match.pd (/* (x & ~m) | (y & m) -> ... */): Modified fold pattern. * genmatch.c (convert3): New convert operation to support the above. gcc/

[AArch64][SVE2] Fix for r277110 (BSL variants)

2019-10-17 Thread Yuliang Wang

; Built and tested on aarch64-none-elf. Regards, Yuliang Wang gcc/ChangeLog: 2019-10-17 Yuliang Wang * match.pd (/* (x & ~m) | (y & m) -> ... */): Modified fold pattern. * genmatch.c (convert3): New convert operation to support the above. gcc/

RE: [AArch64][SVE2] Support for EOR3 and variants of BSL

2019-10-17 Thread Yuliang Wang

Thanks very much, updated. Regards, Yuliang gcc/ChangeLog: 2019-10-17 Yuliang Wang * config/aarch64/aarch64-sve2.md (aarch64_sve2_eor3) (aarch64_sve2_nor, aarch64_sve2_nand) (aarch64_sve2_bsl, aarch64_sve2_nbsl) (aarch64_sve2_bsl1n, aarch64_sve2_bsl2n

RE: [AArch64][SVE2] Support for EOR3 and variants of BSL

2019-10-17 Thread Yuliang Wang

Hi Richard, Thanks for the suggestions, updated. Regards, Yuliang gcc/ChangeLog: 2019-10-17 Yuliang Wang * config/aarch64/aarch64-sve2.md (aarch64_sve2_eor3) (aarch64_sve2_nor, aarch64_sve2_nand) (aarch64_sve2_bsl, aarch64_sve2_nbsl) (aarch64_sve2_bsl1n

[AArch64][SVE2] Support for EOR3 and variants of BSL

2019-10-16 Thread Yuliang Wang

y -> NBSL x, y, x NAND x, y -> NBSL x, y, y Built and tested on aarch64-none-elf. Best Regards, Yuliang Wang gcc/ChangeLog: 2019-10-16 Yuliang Wang * config/aarch64/aarch64-sve2.md (aarch64_sve2_eor3) (aarch64_sve2_nor, aarch64_sve2_nand) (aarch64_

RE: [AArch64][SVE2] Shift-Right Accumulate combine patterns

2019-10-04 Thread Yuliang Wang

Hi Christophe, Thanks for pointing this out, uploaded a fix. Regards, Yuliang From: Christophe Lyon Sent: 04 October 2019 09:25 To: Richard Sandiford Cc: Yuliang Wang ; gcc-patches@gcc.gnu.org; nd Subject: Re: [AArch64][SVE2] Shift-Right Accumulate combine patterns On Fri, 27 Sep 2019

[AArch64][SVE2] Fix for new test in r276174

2019-10-04 Thread Yuliang Wang

Hi, The new test added as part of r276174 fails on an ilp32 target, which this patch fixes. Thanks to Christophe Lyon for pointing this out. Regression tested on aarch64-none-elf. Best Regards, Yuliang Wang gcc/testsuite/ChangeLog: 2019-10-04 Yuliang Wang * gcc.target/aarch64

RE: [AArch64][SVE] Utilize ASRD instruction for division and remainder

2019-09-27 Thread Yuliang Wang

Thanks for the corrections, updated. Regards Yuliang (no ChangeLog updates) -Original Message- From: Richard Sandiford Sent: 27 September 2019 11:20 To: Yuliang Wang Cc: gcc-patches@gcc.gnu.org; nd Subject: Re: [AArch64][SVE] Utilize ASRD instruction for division and remainder

RE: [AArch64][SVE] Utilize ASRD instruction for division and remainder

2019-09-27 Thread Yuliang Wang

Apologies for the accidental change, and added the underscore. Regards Yuliang gcc/ChangeLog: 2019-09-27 Yuliang Wang * config/aarch64/aarch64-sve.md (sdiv_pow23): New pattern for ASRD. * config/aarch64/iterators.md (UNSPEC_ASRD): New unspec. * internal

RE: [AArch64][SVE] Utilize ASRD instruction for division and remainder

2019-09-27 Thread Yuliang Wang

Hi Richard, I have renamed the optabs and associated identifiers as per your suggestion. Thanks. Regards Yuliang gcc/ChangeLog: 2019-09-27 Yuliang Wang * config/aarch64/aarch64-sve.md (sdiv_pow23): New pattern for ASRD. * config/aarch64/iterators.md (UNSPEC_ASRD

[AArch64][SVE2] Shift-Right Accumulate combine patterns

2019-09-26 Thread Yuliang Wang

z0.s, z0.s, z1.s ... after ssraz0.s, z1.s, #{IMM} Unsigned: beforelsr z0.s, z0.s, #{IMM} add z0.s, z0.s, z1.s ... after usraz0.s, z1.s, #{IMM} Built and regression tested on aarch64-none-elf. Best Regards, Yuliang Wang gcc/

[AArch64][SVE] Utilize ASRD instruction for division and remainder

2019-09-24 Thread Yuliang Wang

ret Added new tests. Built and regression tested on aarch64-none-elf. Best Regards, Yuliang Wang gcc/ChangeLog: 2019-09-23 Yuliang Wang * config/aarch64/aarch64-sve.md (asrd3): New pattern for ASRD. * config/aarch64/iterators.md (UNSPEC_ASRD): New unspec. (ASRDIV)

RE: [PATCH] Reduction of conditional operations for vectorization

2019-09-20 Thread Yuliang Wang

return NULL_TREE; ... For instance, this causes infinite recursion in gcc.dg/vect/fast-math-vect-call-2 because ARG is a float literal. Regards, Yuliang -Original Message- From: Richard Biener Sent: 20 September 2019 13:02 To: Yuliang Wang Cc: gcc-patches@gcc.gnu.org; nd ; Richard S

[PATCH] Reduction of conditional operations for vectorization

2019-09-20 Thread Yuliang Wang

, Yuliang Wang gcc/ChangeLog: 2019-09-19 Yuliang Wang * match.pd (for cnd (cond vec_cond)): New match statements for the above patterns. * doc/sourcebuild.texi (vect_condred_si): Document new target selector. gcc/testsuite/ChangeLog: 2019-09-19 Yuliang Wang

RE: [PATCH][AArch64] Vectorize MULH(R)S patterns with SVE2 instructions

2019-09-12 Thread Yuliang Wang

Hi Richard, Thanks for your comments and advice; I have applied the relevant changes. Regards, Yuliang UPDATE: Added new tests. Built and regression tested on aarch64-none-elf and aarch64-linux-gnu. gcc/ChangeLog: 2019-09-1 Yuliang Wang PR tree-optimization/89386

[PATCH][AArch64] Vectorize MULH(R)S patterns with SVE2 instructions

2019-08-29 Thread Yuliang Wang

} ... vectorizes with SHRNT/SHRNB * 32-bit and 8-bit input/output types * Signed output types SMULLT/SMULLB are generated instead SQRDMULH was considered as a potential single-instruction optimization but saturates the intermediate value instead of truncating. Best Regards, Yuliang Wang

[match.pd] Mid-end fix for r277110

[AArch64][SVE2] Fix for r277110 (BSL variants)

RE: [AArch64][SVE2] Support for EOR3 and variants of BSL

RE: [AArch64][SVE2] Support for EOR3 and variants of BSL

[AArch64][SVE2] Support for EOR3 and variants of BSL

RE: [AArch64][SVE2] Shift-Right Accumulate combine patterns

[AArch64][SVE2] Fix for new test in r276174

RE: [AArch64][SVE] Utilize ASRD instruction for division and remainder

RE: [AArch64][SVE] Utilize ASRD instruction for division and remainder

RE: [AArch64][SVE] Utilize ASRD instruction for division and remainder

[AArch64][SVE2] Shift-Right Accumulate combine patterns

[AArch64][SVE] Utilize ASRD instruction for division and remainder

RE: [PATCH] Reduction of conditional operations for vectorization

[PATCH] Reduction of conditional operations for vectorization

RE: [PATCH][AArch64] Vectorize MULH(R)S patterns with SVE2 instructions

[PATCH][AArch64] Vectorize MULH(R)S patterns with SVE2 instructions

16 matches

Site Navigation

Mail list logo

Footer information