Am 01.06.2013 13:37, schrieb Tove:
On Saturday, 1 June 2013 at 10:57:03 UTC, Benjamin Thaut wrote:
Am 01.06.2013 12:52, schrieb Tove:
does this generate better code?
float4 v = __vector([1.0f, 2.0f, 3.0f, 4.0f]);
That doesn't even compile. You can try it out yourself using:
http://dpaste.dzfl.pl/
Kind Regards
Benjamin Thaut
OK, sorry about that... this compiles, but the 'Disassembly' button is
not functional for me...
http://dpaste.dzfl.pl/1e0407c3
The preoduces assembly is exactly as bad:
mov qword ptr [rbp+10h],rcx
movaps xmm0,xmmword ptr [13F154020h]
movaps xmmword ptr [rbp-60h],xmm0
movaps xmm0,xmmword ptr [rbp-60h]
movaps xmmword ptr [rbp-50h],xmm0
movdqa xmm0,xmmword ptr [rbp-50h]
addps xmm0,xmmword ptr [rbp-50h]
movaps xmmword ptr [rbp-50h],xmm0
movdqa xmm0,xmmword ptr [rbp-50h]
movaps xmmword ptr [rbp-60h],xmm0