On Fri, Nov 30, 2012 at 6:58 AM, Christophe Gisquet <[email protected]> wrote: > From 210 cycles to 87 on penrynn. > Unrolling and not storing mask both save some cycles. > --- > libavcodec/x86/sbrdsp.asm | 21 +++++++++++++++++++++ > libavcodec/x86/sbrdsp_init.c | 2 ++ > 2 files changed, 23 insertions(+), 0 deletions(-) > > diff --git a/libavcodec/x86/sbrdsp.asm b/libavcodec/x86/sbrdsp.asm > index aff6879..49dd78c 100644 > --- a/libavcodec/x86/sbrdsp.asm > +++ b/libavcodec/x86/sbrdsp.asm > @@ -206,6 +206,26 @@ cglobal sbr_sum64x5, 1,2,4,z > jne .loop > REP_RET > > +cglobal sbr_neg_odd_64, 1,2,4,z > + lea r1q, [zq+256] > +.loop: > + mova m0, [zq+ 0] > + mova m1, [zq+16] > + mova m2, [zq+32] > + mova m3, [zq+48] > + xorps m0, [ps_mask2] > + xorps m1, [ps_mask2] > + xorps m2, [ps_mask2] > + xorps m3, [ps_mask2]
Maybe save this mask value in a register instead of loading it repeatedly? Jason _______________________________________________ libav-devel mailing list [email protected] https://lists.libav.org/mailman/listinfo/libav-devel
