Le lauantaina 15. heinäkuuta 2023, 23.57.04 EEST Rémi Denis-Courmont a écrit :
> Segmented loads are kinda slow, so this advantageously uses a
> unit-strided load and narrowing shifts instead.
>
> Before:
> ps_add_squares_rvv_f32: 30352.2
>
> After:
> ps_add_squares_rvv_f32: 11973.7
> ---
>
Segmented loads are kinda slow, so this advantageously uses a
unit-strided load and narrowing shifts instead.
Before:
ps_add_squares_rvv_f32: 30352.2
After:
ps_add_squares_rvv_f32: 11973.7
---
libavcodec/riscv/aacpsdsp_rvv.S | 11 +++
1 file changed, 7 insertions(+), 4 deletions(-)