On Fri, Nov 30, 2012 at 6:58 AM, Christophe Gisquet
<[email protected]> wrote:
> From 210 cycles to 87 on penrynn.
> Unrolling and not storing mask both save some cycles.
> ---
>  libavcodec/x86/sbrdsp.asm    |   21 +++++++++++++++++++++
>  libavcodec/x86/sbrdsp_init.c |    2 ++
>  2 files changed, 23 insertions(+), 0 deletions(-)
>
> diff --git a/libavcodec/x86/sbrdsp.asm b/libavcodec/x86/sbrdsp.asm
> index aff6879..49dd78c 100644
> --- a/libavcodec/x86/sbrdsp.asm
> +++ b/libavcodec/x86/sbrdsp.asm
> @@ -206,6 +206,26 @@ cglobal sbr_sum64x5, 1,2,4,z
>    jne  .loop
>    REP_RET
>
> +cglobal sbr_neg_odd_64, 1,2,4,z
> +  lea       r1q, [zq+256]
> +.loop:
> +  mova       m0, [zq+ 0]
> +  mova       m1, [zq+16]
> +  mova       m2, [zq+32]
> +  mova       m3, [zq+48]
> +  xorps      m0, [ps_mask2]
> +  xorps      m1, [ps_mask2]
> +  xorps      m2, [ps_mask2]
> +  xorps      m3, [ps_mask2]

Maybe save this mask value in a register instead of loading it repeatedly?

Jason
_______________________________________________
libav-devel mailing list
[email protected]
https://lists.libav.org/mailman/listinfo/libav-devel

Reply via email to