On Fri, 30 Nov 2012, Christophe Gisquet wrote:

> +cglobal sbr_qmf_post_shuffle, 2,3,3,W,z
> +  lea       r2q, [zq + (64-4)*4]
> +.loop:
> +  mova       m0, [r2q]
> +  mova       m1, [zq ]
> +  xorps      m0, [ps_neg]
> +  shufps     m0, m0, 0x1B
> +  mova       m2, m0
> +  unpcklps   m0, m1
> +  unpckhps   m2, m1
> +  mova  [Wq +  0], m0
> +  mova  [Wq + 16], m2
> +  add        Wq, 32
> +  sub       r2q, 16
> +  add        zq, 16
> +  cmp        zq, r2q
> +  jl      .loop
> +  REP_RET

If you increment an index into W and z rather than the pointers
themselves, then you can eliminate an add and a cmp.

4 space tabs.

--Loren Merritt
_______________________________________________
libav-devel mailing list
[email protected]
https://lists.libav.org/mailman/listinfo/libav-devel

Reply via email to