pare might not be hot enough.
>
> Tested it with my micro benchmark. 5-10% performance gain when compare
> length is 7.
Nice!
>
>>
>>>
>>> Bootstrapped and tested on powerpc64-linux BE and LE with no
>>> regressions. Is it OK for the trunk?
>&
Hi Kewen,
在 2024/5/9 13:44, Kewen.Lin 写道:
> Why does it need power8 forced here?
I think it over. It's no need. For the sub-targets which library is
called, l[hb]z won't be generated too.
Thanks
Gui Haochen
7.
>
>>
>> Bootstrapped and tested on powerpc64-linux BE and LE with no
>> regressions. Is it OK for the trunk?
>>
>> Thanks
>> Gui Haochen
>>
>> ChangeLog
>> rs6000: Enable overlapped by-pieces operations
>>
>> This patch enab
me benchmark?
>
> Bootstrapped and tested on powerpc64-linux BE and LE with no
> regressions. Is it OK for the trunk?
>
> Thanks
> Gui Haochen
>
> ChangeLog
> rs6000: Enable overlapped by-pieces operations
>
> This patch enables overlapped by-piece operations by def
: Enable overlapped by-pieces operations
This patch enables overlapped by-piece operations by defining
TARGET_OVERLAP_OP_BY_PIECES_P to true. On rs6000, default move/set/clear
ratio is 2. So the overlap is only enabled with compare by-pieces.
gcc/
* config/rs6000/rs6000.cc