align(16) union __m128i { ubyte[16] data };
void store(__m128i* src, __m128i* dst) { asm { movdqu [dst], src; } }The compiler complains about a "bad type/size of operands 'movdqu'", but these two data segments are 16 byte align so they should be in an XMM# register? Is there something I'm missing here?