Re: [FFmpeg-devel] [PATCH 4/8] avcodec/flac: partially unroll loop in flac_enc_lpc_32

2017-11-26 Thread Rostislav Pehlivanov
On 26 November 2017 at 22:51, James Darnley wrote: > Now does 6 samples per iteration, up from 2. > > From 1.6 to 2.1 times faster again. 2.5 to 3.9 times faster overall. > Runtime is reduced by a further 4 to 17%. Reduced by 9 to 65% overall. > > Same conditions as previously. > --- > libavco

[FFmpeg-devel] [PATCH 4/8] avcodec/flac: partially unroll loop in flac_enc_lpc_32

2017-11-26 Thread James Darnley
Now does 6 samples per iteration, up from 2. From 1.6 to 2.1 times faster again. 2.5 to 3.9 times faster overall. Runtime is reduced by a further 4 to 17%. Reduced by 9 to 65% overall. Same conditions as previously. --- libavcodec/x86/flac_dsp_gpl.asm | 30 +- 1 fil