> Sorry, but the internal assembler of fpc already supports SSE/SSE2/3Dnow
> since years.
> 
Perhaps I don't understand the SSE instructions correctly but AFAIK this should 
work:

movups xmm0, [v1]  // load v1 into reg0
addps xmm0, [v2]   // add reg0 and v2
movups [vr], xmm0  // write reg0 to xmm0

The above code creates an Acess violation.
If I load v2 into a second register and add the two registers everything works 
as expected.

movups xmm0, [v1] // load v1 into reg0
movups xmm1, [v2] // load v2 into reg1
addps xmm0, xmm1  // add reg0 and reg1
movups [vr], xmm0 // write reg0 to vr

v1, v2, vr are packed records of 4 singles

fpc version is 2.0.2

_________________________________________________________________
     To unsubscribe: mail [EMAIL PROTECTED] with
                "unsubscribe" as the Subject
   archives at http://www.lazarus.freepascal.org/mailarchives

Reply via email to