On Fri, 30 Nov 2012, Christophe Gisquet wrote:
> + mova m0, [X_lowq + start]
> + movlhps m1, m1 ; (a2 a3 a2 a3)
> + movlhps m2, m2 ; (a0 a1 a0 a1)
> + shufps m3, m3, 00010001b ; (a3 a2 a3 a2)
> + shufps m4, m4, 00010001b ; (a1 a0 a1 a0)
> + xorps m3, m7 ; (-a3 a2 -a3 a2)
> + xorps m4, m7 ; (-a1 a0 -a1 a0)
> +.loop2:
> + mova m5, m0
> + mova m6, m0
> + shufps m0, m0, 10100000b ; {Xl[-2][0],",Xl[-1][0],"}
> + shufps m5, m5, 11110101b ; {Xl[-2][1],",Xl[-1][1],"}
> + mulps m0, m2
> + mulps m5, m4
> + mova m7, m6
> + addps m5, m0
> + mova m0, [X_lowq + start + 2*2*4]
> + shufps m6, m0, 00001010b ; {Xl[-1][0],",Xl[0][0],"}
> + shufps m7, m0, 01011111b ; {Xl[-1][1],",Xl[1][1],"}
Recommend using base-4 for shuffle constants.
--Loren Merritt
_______________________________________________
libav-devel mailing list
[email protected]
https://lists.libav.org/mailman/listinfo/libav-devel