RE: [PATCH] Support -m[no-]gather -m[no-]scatter to enable/disable vectorization for all gather/scatter instructions.

2023-08-09 Thread Liu, Hongtao via Gcc-patches
> -Original Message- > From: Xi Ruoyao > Sent: Thursday, August 10, 2023 9:48 AM > To: Liu, Hongtao ; gcc-patches@gcc.gnu.org > Cc: richard.guent...@gmail.com; ubiz...@gmail.com; hubi...@ucw.cz > Subject: Re: [PATCH] Support -m[no-]gather -m[no-]scatter to enable/disable > vectorization

RE: [PATCH V2] [X86] Workaround possible CPUID bug in Sandy Bridge.

2023-08-09 Thread Liu, Hongtao via Gcc-patches
> -Original Message- > From: Uros Bizjak > Sent: Wednesday, August 9, 2023 2:33 PM > To: Liu, Hongtao > Cc: gcc-patches@gcc.gnu.org > Subject: Re: [PATCH V2] [X86] Workaround possible CPUID bug in Sandy > Bridge. > > On Wed, Aug 9, 2023 at 3:48 AM liuhongt wrote: > > > > > Please

RE: [PATCH] x86: fold two of vec_dupv2df's alternatives

2023-08-01 Thread Liu, Hongtao via Gcc-patches
> -Original Message- > From: Jan Beulich > Sent: Tuesday, August 1, 2023 1:49 PM > To: gcc-patches@gcc.gnu.org > Cc: Liu, Hongtao ; Kirill Yukhin > > Subject: [PATCH] x86: fold two of vec_dupv2df's alternatives > > By using Yvm in the source, both can be expressed in one. > > gcc/ >

RE: [PATCH] Replace invariant ternlog operands

2023-07-26 Thread Liu, Hongtao via Gcc-patches
> -Original Message- > From: Yan Simonaytes > Sent: Wednesday, July 26, 2023 2:11 AM > To: gcc-patches@gcc.gnu.org > Cc: Liu, Hongtao ; Uros Bizjak ; > Yan Simonaytes > Subject: [PATCH] Replace invariant ternlog operands > > Sometimes GCC generates ternlog with three operands, but

RE: [PATCH] Initial Granite Rapids D Support

2023-07-11 Thread Liu, Hongtao via Gcc-patches
> -Original Message- > From: Mo, Zewei > Sent: Wednesday, July 12, 2023 1:56 PM > To: gcc-patches@gcc.gnu.org > Cc: Liu, Hongtao ; ubiz...@gmail.com > Subject: [PATCH] Initial Granite Rapids D Support > > Hi all, > > This patch is to add initial support for Granite Rapids D for GCC.

RE: [PATCH] x86: improve fast bfloat->float conversion

2023-07-11 Thread Liu, Hongtao via Gcc-patches
> -Original Message- > From: Jan Beulich > Sent: Tuesday, July 11, 2023 3:50 PM > To: Liu, Hongtao > Cc: Kirill Yukhin ; gcc-patches@gcc.gnu.org > Subject: Re: [PATCH] x86: improve fast bfloat->float conversion > > On 11.07.2023 08:45, Liu, Hongtao wrote: > >> -Original

RE: [PATCH] x86: improve fast bfloat->float conversion

2023-07-11 Thread Liu, Hongtao via Gcc-patches
> -Original Message- > From: Jan Beulich > Sent: Tuesday, July 11, 2023 2:08 PM > To: gcc-patches@gcc.gnu.org > Cc: Liu, Hongtao ; Kirill Yukhin > > Subject: [PATCH] x86: improve fast bfloat->float conversion > > There's nothing AVX512BW-ish in here, so no reason to use Yw as the >

RE: [PATCH v3] x86: make better use of VBROADCASTSS / VPBROADCASTD

2023-07-11 Thread Liu, Hongtao via Gcc-patches
> -Original Message- > From: Jan Beulich > Sent: Tuesday, July 11, 2023 2:04 PM > To: gcc-patches@gcc.gnu.org > Cc: Kirill Yukhin ; Liu, Hongtao > > Subject: [PATCH v3] x86: make better use of VBROADCASTSS / > VPBROADCASTD > > ... in vec_dupv4sf / *vec_dupv4si. The respective

RE: [PATCH] Initial Granite Rapids D Support

2023-07-06 Thread Liu, Hongtao via Gcc-patches
> -Original Message- > From: Mo, Zewei > Sent: Thursday, July 6, 2023 2:37 PM > To: gcc-patches@gcc.gnu.org > Cc: Liu, Hongtao ; ubiz...@gmail.com > Subject: [PATCH] Initial Granite Rapids D Support > > Hi all, > > This patch is to add initial support for Granite Rapids D for GCC. >

RE: [PATCH v3] x86: make VPTERNLOG* usable on less than 512-bit operands with just AVX512F

2023-07-04 Thread Liu, Hongtao via Gcc-patches
> -Original Message- > From: Jan Beulich > Sent: Tuesday, July 4, 2023 11:30 PM > To: Hongtao Liu > Cc: gcc-patches@gcc.gnu.org; Kirill Yukhin ; Liu, > Hongtao > Subject: Re: [PATCH v3] x86: make VPTERNLOG* usable on less than 512-bit > operands with just AVX512F > > On 27.06.2023

RE: [PATCH v2] x86: make better use of VBROADCASTSS / VPBROADCASTD

2023-06-24 Thread Liu, Hongtao via Gcc-patches
> -Original Message- > From: Jan Beulich > Sent: Wednesday, June 21, 2023 8:40 PM > To: Hongtao Liu > Cc: gcc-patches@gcc.gnu.org; Kirill Yukhin ; Liu, > Hongtao > Subject: Re: [PATCH v2] x86: make better use of VBROADCASTSS / > VPBROADCASTD > > On 21.06.2023 09:44, Jan Beulich

RE: [PATCH v2] x86: make VPTERNLOG* usable on less than 512-bit operands with just AVX512F

2023-06-18 Thread Liu, Hongtao via Gcc-patches
> -Original Message- > From: Jan Beulich > Sent: Friday, June 16, 2023 2:22 PM > To: gcc-patches@gcc.gnu.org > Cc: Kirill Yukhin ; Liu, Hongtao > > Subject: [PATCH v2] x86: make VPTERNLOG* usable on less than 512-bit > operands with just AVX512F > > There's no reason to constrain this

RE: [PATCH v2] x86: correct and improve "*vec_dupv2di"

2023-06-18 Thread Liu, Hongtao via Gcc-patches
> -Original Message- > From: Jan Beulich > Sent: Friday, June 16, 2023 2:20 PM > To: gcc-patches@gcc.gnu.org > Cc: Liu, Hongtao ; Kirill Yukhin > > Subject: [PATCH v2] x86: correct and improve "*vec_dupv2di" > > The input constraint for the %vmovddup alternative was wrong, as the

RE: [PATCH 3/3] AVX512 fully masked vectorization

2023-06-14 Thread Liu, Hongtao via Gcc-patches
> -Original Message- > From: Richard Biener > Sent: Wednesday, June 14, 2023 10:30 PM > To: Andrew Stubbs > Cc: gcc-patches@gcc.gnu.org; richard.sandif...@arm.com; Jan Hubicka > ; Liu, Hongtao ; > kirill.yuk...@gmail.com > Subject: Re: [PATCH 3/3] AVX512 fully masked vectorization > >

RE: [PATCH] i386: Fix incorrect intrinsic signature for AVX512 s{lli|rai|rli}

2023-05-25 Thread Liu, Hongtao via Gcc-patches
> -Original Message- > From: Hu, Lin1 > Sent: Thursday, May 25, 2023 3:52 PM > To: Hongtao Liu > Cc: gcc-patches@gcc.gnu.org; Liu, Hongtao ; > ubiz...@gmail.com > Subject: RE: [PATCH] i386: Fix incorrect intrinsic signature for AVX512 > s{lli|rai|rli} > > OK, I update the change log

RE: [PATCH 1/2] Use NO_REGS in cost calculation when the preferred register class are not known yet.

2023-04-22 Thread Liu, Hongtao via Gcc-patches
> -Original Message- > From: Vladimir Makarov > Sent: Saturday, April 22, 2023 3:26 AM > To: Liu, Hongtao ; gcc-patches@gcc.gnu.org > Cc: crazy...@gmail.com; hjl.to...@gmail.com > Subject: Re: [PATCH 1/2] Use NO_REGS in cost calculation when the > preferred register class are not known

RE: [PATCH] i386: Share AES xmm intrin with VAES

2023-04-18 Thread Liu, Hongtao via Gcc-patches
> -Original Message- > From: Jiang, Haochen > Sent: Wednesday, April 19, 2023 10:41 AM > To: Hongtao Liu > Cc: gcc-patches@gcc.gnu.org; Liu, Hongtao ; > ubiz...@gmail.com > Subject: RE: [PATCH] i386: Share AES xmm intrin with VAES > > > > a/gcc/config/i386/sse.md

RE: [PATCH] Re-arrange sections of i386 cpuid

2023-04-18 Thread Liu, Hongtao via Gcc-patches
> -Original Message- > From: Mo, Zewei > Sent: Wednesday, April 19, 2023 10:03 AM > To: gcc-patches@gcc.gnu.org > Cc: Liu, Hongtao ; ubiz...@gmail.com > Subject: [PATCH] Re-arrange sections of i386 cpuid > > Re-order i386 cpuid based on the order of CPUID. > > gcc/ChangeLog: > >

RE: [PATCH] Check hard_regno_mode_ok before setting lowest memory move cost for the mode with different reg classes.

2023-04-05 Thread Liu, Hongtao via Gcc-patches
> -Original Message- > From: Vladimir Makarov > Sent: Wednesday, April 5, 2023 8:59 PM > To: Jeff Law ; Liu, Hongtao > ; gcc-patches@gcc.gnu.org > Subject: Re: [PATCH] Check hard_regno_mode_ok before setting lowest > memory move cost for the mode with different reg classes. > > > On

RE: [PATCH] i386: Fix up -Wuninitialized warnings in avx512erintrin.h [PR105593]

2023-01-31 Thread Liu, Hongtao via Gcc-patches
> -Original Message- > From: Jakub Jelinek > Sent: Tuesday, January 31, 2023 4:09 PM > To: Liu, Hongtao ; Uros Bizjak > Cc: gcc-patches@gcc.gnu.org > Subject: [PATCH] i386: Fix up -Wuninitialized warnings in avx512erintrin.h > [PR105593] > > Hi! > > As reported in the PR, there are

RE: [PATCH 2/4] Initial Emeraldrapids Support

2023-01-03 Thread Liu, Hongtao via Gcc-patches
There are actually only two patches, not four, and the subject *Patch 2/4* should be a typo. > -Original Message- > From: Hu, Lin1 > Sent: Tuesday, January 3, 2023 4:37 PM > To: gcc-patches@gcc.gnu.org > Cc: Liu, Hongtao ; ubiz...@gmail.com > Subject: [PATCH 2/4] Initial Emeraldrapids

RE: [PATCH] [x86] x86: Don't add crtfastmath.o for -shared and add a new option -mdaz-ftz to enable FTZ and DAZ flags in MXCSR.

2022-12-14 Thread Liu, Hongtao via Gcc-patches
> -Original Message- > From: Richard Biener > Sent: Wednesday, December 14, 2022 4:23 PM > To: Jakub Jelinek > Cc: Liu, Hongtao ; gcc-patches@gcc.gnu.org; > crazy...@gmail.com; hjl.to...@gmail.com; ubiz...@gmail.com > Subject: Re: [PATCH] [x86] x86: Don't add crtfastmath.o for -shared

RE: [PATCH] i386: Only enable small loop unrolling in backend [PR 107602]

2022-11-20 Thread Liu, Hongtao via Gcc-patches
> -Original Message- > From: Wang, Hongyu > Sent: Saturday, November 19, 2022 2:26 PM > To: gcc-patches@gcc.gnu.org > Cc: richard.guent...@gmail.com; ubiz...@gmail.com; Liu, Hongtao > > Subject: [PATCH] i386: Only enable small loop unrolling in backend [PR 107602] > > Hi, > >

RE: [PATCH 4/6] Support Intel AVX-NE-CONVERT

2022-10-30 Thread Liu, Hongtao via Gcc-patches
> -Original Message- > From: Kong, Lingling > Sent: Friday, October 28, 2022 4:57 PM > To: Hongtao Liu > Cc: Liu, Hongtao ; gcc-patches@gcc.gnu.org; Jiang, > Haochen > Subject: RE: [PATCH 4/6] Support Intel AVX-NE-CONVERT > > Hi, > > Because we switch intrinsics for avx512bf16 to

RE: [PATCH] MAINTAINERS: Add myself for write after approval

2022-10-12 Thread Liu, Hongtao via Gcc-patches
> -Original Message- > From: Cui, Lili > Sent: Wednesday, October 12, 2022 3:50 PM > To: gcc-patches@gcc.gnu.org > Cc: Liu, Hongtao > Subject: [PATCH] MAINTAINERS: Add myself for write after approval > > Hi, > > I want to add myself in MAINTANINER for write after approval. > > OK

RE: [PATCH] Remove AVX512_VP2INTERSECT from PTA_SAPPHIRERAPIDS

2022-10-11 Thread Liu, Hongtao via Gcc-patches
> -Original Message- > From: Cui, Lili > Sent: Wednesday, October 12, 2022 11:00 AM > To: gcc-patches@gcc.gnu.org > Cc: Liu, Hongtao ; ubiz...@gmail.com; Lu, Hongjiu > > Subject: [PATCH] Remove AVX512_VP2INTERSECT from PTA_SAPPHIRERAPIDS > > Hi Hontao, > > This patch is to remove

RE: [PATCH] [x86] Add define_insn_and_split to support general version of "kxnor".

2022-10-11 Thread Liu, Hongtao via Gcc-patches
> -Original Message- > From: Jakub Jelinek > Sent: Tuesday, October 11, 2022 9:59 PM > To: Liu, Hongtao > Cc: gcc-patches@gcc.gnu.org > Subject: Re: [PATCH] [x86] Add define_insn_and_split to support general > version of "kxnor". > > On Tue, Oct 11, 2022 at 04:03:16PM +0800, liuhongt

RE: [PATCH] testsuite: Fix up avx256-unaligned-store-3.c test.

2022-09-25 Thread Liu, Hongtao via Gcc-patches
> -Original Message- > From: Hu, Lin1 > Sent: Monday, September 26, 2022 1:20 PM > To: gcc-patches@gcc.gnu.org > Cc: Liu, Hongtao ; ubiz...@gmail.com > Subject: [PATCH] testsuite: Fix up avx256-unaligned-store-3.c test. > > Hi all, > > This patch aims to fix a problem that

RE: [PATCH] i386: Add syscall to enable AMX for latest kernels

2022-09-22 Thread Liu, Hongtao via Gcc-patches
> -Original Message- > From: Jiang, Haochen > Sent: Thursday, September 22, 2022 2:23 PM > To: Uros Bizjak > Cc: gcc-patches@gcc.gnu.org; Liu, Hongtao > Subject: RE: [PATCH] i386: Add syscall to enable AMX for latest kernels > > Hi all, > > I would like to backport this patch to GCC

RE: [PATCH] i386: Fixed vec_init_dup_v16bf [PR106887]

2022-09-16 Thread Liu, Hongtao via Gcc-patches
> -Original Message- > From: Kong, Lingling > Sent: Friday, September 16, 2022 3:40 PM > To: Hongtao Liu > Cc: gcc-patches@gcc.gnu.org; Liu, Hongtao > Subject: RE: [PATCH] i386: Fixed vec_init_dup_v16bf [PR106887] > > Hi, > > > > machine_mode hvmode = (mode == V16HImode ?

RE: [PATCH] i386: Extend cvtps2pd to memory

2022-06-30 Thread Liu, Hongtao via Gcc-patches
> -Original Message- > From: Uros Bizjak > Sent: Thursday, June 30, 2022 4:53 PM > To: Jiang, Haochen > Cc: gcc-patches@gcc.gnu.org; Liu, Hongtao > Subject: Re: [PATCH] i386: Extend cvtps2pd to memory > > On Thu, Jun 30, 2022 at 10:45 AM Uros Bizjak wrote: > > > > On Thu, Jun 30,

RE: [PATCH] i386: Add AVX512BW to AVX512F in MASK_ISA2

2022-06-29 Thread Liu, Hongtao via Gcc-patches
> -Original Message- > From: Jiang, Haochen > Sent: Thursday, June 30, 2022 9:51 AM > To: gcc-patches@gcc.gnu.org > Cc: ubiz...@gmail.com; Liu, Hongtao > Subject: [PATCH] i386: Add AVX512BW to AVX512F in MASK_ISA2 > > Hi all, > > I just found in MASK_ISA2_UNSET part, since AVX512BW

RE: [PATCH] Add a bit dislike for separate mem alternative when op is REG_P.

2022-05-29 Thread Liu, Hongtao via Gcc-patches
> -Original Message- > From: Alexander Monakov > Sent: Friday, May 27, 2022 5:39 PM > To: Liu, Hongtao > Cc: gcc-patches@gcc.gnu.org > Subject: Re: [PATCH] Add a bit dislike for separate mem alternative when op is > REG_P. > > On Wed, 25 May 2022, liuhongt via Gcc-patches wrote: > >

RE: [PATCH] Optimize vpermtiw/b to vpunpcklqdq for certain cases.

2022-05-13 Thread Liu, Hongtao via Gcc-patches
> -Original Message- > From: Uros Bizjak > Sent: Friday, May 13, 2022 4:15 PM > To: Liu, Hongtao > Cc: gcc-patches@gcc.gnu.org > Subject: Re: [PATCH] Optimize vpermtiw/b to vpunpcklqdq for certain cases. > > On Fri, May 13, 2022 at 9:16 AM liuhongt wrote: > > > > Assembly

RE: [PATCH] docs: Document new param x86-stlf-window-ninsns.

2022-04-06 Thread Liu, Hongtao via Gcc-patches
> -Original Message- > From: Martin Liška > Sent: Wednesday, April 6, 2022 3:35 PM > To: gcc-patches@gcc.gnu.org > Cc: Liu, Hongtao > Subject: [PATCH] docs: Document new param x86-stlf-window-ninsns. > > Hi. > > The patch documents the newly added parameter. One question I have is if

RE: [PATCH v3] AVX512FP16: Fix wrong code for _mm_mask_f[c]madd.*sch [PR 104978]

2022-03-21 Thread Liu, Hongtao via Gcc-patches
> -Original Message- > From: Wang, Hongyu > Sent: Tuesday, March 22, 2022 11:28 AM > To: Liu, Hongtao > Cc: gcc-patches@gcc.gnu.org > Subject: [PATCH v3] AVX512FP16: Fix wrong code for _mm_mask_f[c]madd.*sch > [PR 104978] > > Hi, here is the patch with force_reg before

RE: [PATCH] x86: Update model value for Alderlake and Rocketlake

2022-01-03 Thread Liu, Hongtao via Gcc-patches
> -Original Message- > From: Cui, Lili > Sent: Tuesday, January 4, 2022 1:20 PM > To: gcc-patches@gcc.gnu.org > Cc: ubiz...@gmail.com; Liu, Hongtao ; > hjl.to...@gmail.com > Subject: [PATCH] x86: Update model value for Alderlake and Rocketlake > > Hi Uros, > > This patch is to update

RE: [PATCH] i386: vcvtph2ps and vcvtps2ph should be used to convert _Float16 to SFmode with -mf16c [PR 102811]

2021-11-23 Thread Liu, Hongtao via Gcc-patches
>-Original Message- >From: Kong, Lingling >Sent: Wednesday, November 24, 2021 2:25 PM >To: Liu, Hongtao ; gcc-patches@gcc.gnu.org >Cc: Kong, Lingling >Subject: RE: [PATCH] i386: vcvtph2ps and vcvtps2ph should be used to convert >_Float16 to SFmode with -mf16c [PR 102811] > >Hi, >

RE: [PATCH] AVX512FP16: Support cond_op for HFmode

2021-09-23 Thread Liu, Hongtao via Gcc-patches
>-Original Message- >From: Wang, Hongyu >Sent: Thursday, September 23, 2021 5:16 PM >To: Liu, Hongtao >Cc: gcc-patches@gcc.gnu.org >Subject: [PATCH] AVX512FP16: Support cond_op for HFmode > >Hi, > >This patch extend the expanders for cond_op to support vector HF modes. >bootstraped

RE: [PATCH] Support logic shift left/right for avx512 mask type.

2021-07-21 Thread Liu, Hongtao via Gcc-patches
>-Original Message- >From: Uros Bizjak >Sent: Wednesday, July 21, 2021 4:23 PM >To: Hongtao Liu >Cc: Liu, Hongtao ; gcc-patches@gcc.gnu.org; H. J. Lu >; Richard Biener >Subject: Re: [PATCH] Support logic shift left/right for avx512 mask type. > >On Wed, Jul 21, 2021 at 5:05 AM Hongtao

RE: [PATCH] Canonicalize (vec_duplicate (not A)) to (not (vec_duplicate A)).

2021-06-03 Thread Liu, Hongtao via Gcc-patches
>-Original Message- >From: Segher Boessenkool >Sent: Friday, June 4, 2021 4:00 AM >To: Liu, Hongtao >Cc: Richard Biener ; GCC Patches patc...@gcc.gnu.org> >Subject: Re: [PATCH] Canonicalize (vec_duplicate (not A)) to (not >(vec_duplicate A)). > >On Thu, Jun 03, 2021 at 11:03:43AM

RE: [PATCH] [i386] Fix ICE of insn does not satisfy its constraints.

2021-06-03 Thread Liu, Hongtao via Gcc-patches
>-Original Message- >From: Jakub Jelinek >Sent: Thursday, June 3, 2021 9:49 PM >To: Liu, Hongtao >Cc: gcc-patches@gcc.gnu.org >Subject: Re: [PATCH] [i386] Fix ICE of insn does not satisfy its constraints. > >On Thu, Jun 03, 2021 at 05:07:26PM +0800, liuhongt via Gcc-patches wrote: >>

RE: [PATCH] Canonicalize (vec_duplicate (not A)) to (not (vec_duplicate A)).

2021-06-03 Thread Liu, Hongtao via Gcc-patches
>-Original Message- >From: Segher Boessenkool >Sent: Thursday, June 3, 2021 4:46 AM >To: Richard Biener >Cc: Liu, Hongtao ; GCC Patches patc...@gcc.gnu.org> >Subject: Re: [PATCH] Canonicalize (vec_duplicate (not A)) to (not >(vec_duplicate A)). > >Hi! > >On Wed, Jun 02, 2021 at

RE: gcc-wwwdocs branch master updated. 88e29096c36837553fc841bd1fa5df6caa776b44

2020-11-05 Thread Liu, Hongtao via Gcc-patches
>-Original Message- >From: Liu, Hongtao >Sent: Friday, November 6, 2020 9:22 AM >To: Gerald Pfeifer ; Hongtao Liu ; >hongtao Liu >Cc: gcc-patches@gcc.gnu.org >Subject: RE: gcc-wwwdocs branch master updated. >88e29096c36837553fc841bd1fa5df6caa776b44 > > > >>-Original Message-

RE: gcc-wwwdocs branch master updated. 88e29096c36837553fc841bd1fa5df6caa776b44

2020-11-05 Thread Liu, Hongtao via Gcc-patches
>-Original Message- >From: Gerald Pfeifer >Sent: Friday, November 6, 2020 5:57 AM >To: Hongtao Liu ; hongtao Liu > >Cc: gcc-patches@gcc.gnu.org >Subject: Re: gcc-wwwdocs branch master updated. >88e29096c36837553fc841bd1fa5df6caa776b44 > >On Thu, 29 Oct 2020, hongtao Liu via