Re: [FFmpeg-devel] [aarch64] improve performance of ff_hscale_8_to_15_neon

Jean-Baptiste Kempf Mon, 25 Nov 2019 14:18:57 -0800

Hello,

On Mon, Nov 25, 2019, at 22:59, Sebastian Pop wrote:
> This patch implements ff_hscale_8_to_15_neon with NEON fused multiply 
> accumulate
> and bumps the vectorization factor from 2 to 4. I have seen speedups up to 15%
> on Graviton A1 instances based on A-72 cpus.


Why adding a new version, in intrinsics, instead of changing the existing 
implementation?

Best,

--
Jean-Baptiste Kempf - President
+33 672 704 734


_______________________________________________
ffmpeg-devel mailing list
[email protected]
https://ffmpeg.org/mailman/listinfo/ffmpeg-devel

To unsubscribe, visit link above, or email
[email protected] with subject "unsubscribe".

Re: [FFmpeg-devel] [aarch64] improve performance of ff_hscale_8_to_15_neon

Reply via email to