On 05/15/2011 02:40 PM, Justin Ruggles wrote:

> +%macro PHADDD4 2 ; xmm src, xmm tmp
> +    movhlps  %2, %1
> +    paddd    %1, %2
> +    pshufd   %2, %1, 0x1
> +    paddd    %1, %2
> +%endmacro


I forgot to mention that this was actually faster on sandy bridge than
doing:
phaddd  %1, %1
phaddd  %1, %1

-Justin
_______________________________________________
libav-devel mailing list
[email protected]
https://lists.libav.org/mailman/listinfo/libav-devel

Reply via email to