On 02/22/2010 08:59 AM, Gaetano Mendola wrote:
>>> ippsStdDev_32f(myRealVector, mySize,

> vex amd64->IR: unhandled instruction bytes: 0x48 0xF 0x5A 0x6 0x48 0xF

Disassembling the first 4 bytes gives:   rex.W cvtps2pd (%rsi),%xmm0
The problem is: "What is the meaning of the rex.W prefix on cvtps2pd?"

The Intel manual  http://www.intel.com/Assets/PDF/manual/253666.pdf
    (IntelĀ® 64 and IA-32 Architectures Software Developer's Manual
    Volume 2A: Instruction Set Reference, A-M, December 2009)
lists the four instructions:
    66 0F 5A  cvtpd2ps
       0F 5A  cvtps2pd
    F2 0F 5A  cvtsd2ss
    F3 0F 5A  cvtss2sd
but there is no explicit listing for "48 0F 5A".
The opcode cvtps2pd (0F 5A) is "convert two packed 32-bit floating point
values in %xmm2/mem (source) to two packed 64-bit floating point values
in %xmm1 (destination)".  It is unclear what applying a rex.W prefix means.
Does anybody know?

-- 

------------------------------------------------------------------------------
Download Intel® Parallel Studio Eval
Try the new software tools for yourself. Speed compiling, find bugs
proactively, and fine-tune applications for parallel performance.
See why Intel Parallel Studio got high marks during beta.
http://p.sf.net/sfu/intel-sw-dev
_______________________________________________
Valgrind-users mailing list
Valgrind-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/valgrind-users

Reply via email to