Re: [PATCH v2] LoongArch: Implement FCCmode reload and cstore4

2023-12-21 Thread Jiahao Xu
SPECCPU 2017 and SPECCPU 2006 successfully built and tested, and this patch gives a 1.3% improvement in SPECCPU 2017 fprate on 3A6000, no performance regression was found. This is an effective optimization and looks good. 在 2023/12/15 下午4:57, Xi Ruoyao 写道: We used a branch to load

[PATCH v2] LoongArch: Implement FCCmode reload and cstore4

2023-12-15 Thread Xi Ruoyao
We used a branch to load floating-point comparison results into GPR. This is very slow when the branch is not predictable. Implement movfcc so we can reload FCCmode into GPRs, FPRs, and MEM. Then implement cstore4. gcc/ChangeLog: * config/loongarch/loongarch-tune.h