Re: [FFmpeg-devel] [PATCH 1/2] lavc/aacpsdsp: rework RISC-V add_squares

2023-07-16 Thread Rémi Denis-Courmont
Le lauantaina 15. heinäkuuta 2023, 23.57.04 EEST Rémi Denis-Courmont a écrit : > Segmented loads are kinda slow, so this advantageously uses a > unit-strided load and narrowing shifts instead. > > Before: > ps_add_squares_rvv_f32: 30352.2 > > After: > ps_add_squares_rvv_f32: 11973.7 > --- >

[FFmpeg-devel] [PATCH 1/2] lavc/aacpsdsp: rework RISC-V add_squares

2023-07-15 Thread Rémi Denis-Courmont
Segmented loads are kinda slow, so this advantageously uses a unit-strided load and narrowing shifts instead. Before: ps_add_squares_rvv_f32: 30352.2 After: ps_add_squares_rvv_f32: 11973.7 --- libavcodec/riscv/aacpsdsp_rvv.S | 11 +++ 1 file changed, 7 insertions(+), 4 deletions(-)