On Fri, 30 Nov 2012, Christophe Gisquet wrote:

> +    mova        m0, [X_lowq + start]
> +    movlhps     m1, m1             ; (a2 a3 a2 a3)
> +    movlhps     m2, m2             ; (a0 a1 a0 a1)
> +    shufps      m3, m3, 00010001b  ; (a3 a2 a3 a2)
> +    shufps      m4, m4, 00010001b  ; (a1 a0 a1 a0)
> +    xorps       m3, m7             ; (-a3 a2 -a3 a2)
> +    xorps       m4, m7             ; (-a1 a0 -a1 a0)
> +.loop2:
> +    mova        m5, m0
> +    mova        m6, m0
> +    shufps      m0, m0, 10100000b ; {Xl[-2][0],",Xl[-1][0],"}
> +    shufps      m5, m5, 11110101b ; {Xl[-2][1],",Xl[-1][1],"}
> +    mulps       m0, m2
> +    mulps       m5, m4
> +    mova        m7, m6
> +    addps       m5, m0
> +    mova        m0, [X_lowq + start + 2*2*4]
> +    shufps      m6, m0, 00001010b ; {Xl[-1][0],",Xl[0][0],"}
> +    shufps      m7, m0, 01011111b ; {Xl[-1][1],",Xl[1][1],"}

Recommend using base-4 for shuffle constants.

--Loren Merritt
_______________________________________________
libav-devel mailing list
[email protected]
https://lists.libav.org/mailman/listinfo/libav-devel

Reply via email to