On Fri, 30 Nov 2012, Christophe Gisquet wrote: > +cglobal sbr_qmf_post_shuffle, 2,3,3,W,z > + lea r2q, [zq + (64-4)*4] > +.loop: > + mova m0, [r2q] > + mova m1, [zq ] > + xorps m0, [ps_neg] > + shufps m0, m0, 0x1B > + mova m2, m0 > + unpcklps m0, m1 > + unpckhps m2, m1 > + mova [Wq + 0], m0 > + mova [Wq + 16], m2 > + add Wq, 32 > + sub r2q, 16 > + add zq, 16 > + cmp zq, r2q > + jl .loop > + REP_RET
If you increment an index into W and z rather than the pointers themselves, then you can eliminate an add and a cmp. 4 space tabs. --Loren Merritt _______________________________________________ libav-devel mailing list [email protected] https://lists.libav.org/mailman/listinfo/libav-devel
