Re: SIMD support...

Norbert Nemec Thu, 12 Jan 2012 12:15:21 -0800

On 06.01.2012 02:42, Manu wrote:

I like v128, or something like that. I'll use that for the sake of this
document. I think it is preferable to float4 for a few reasons...

I do not agree at all. That way, the type looses all semanticinformation. This is not only breaking with C/C++/D philosophy butactually *hides* an essential hardware detail on Intel SSE:

An SSE register is 128 bit, but the processor actually cares about thesemantics of the content:

There are different commands for loading two doubles, four singles orintegers to a register. They all load the same 128 bits from memory intothe same register. Anyhow, the specs warn about a performance penaltywhen loading a register as one type and then using it as another. I donot know the internals of the processor, but my understanding is thatthe CPU splits the floats into mantissa, exponent and sign already atthe moment of loading and has to drop that information when youreinterpret the bit pattern stored in the register.

A type v128 would not provide the necessary information for the compilerto produce the correct mov statements.

There definitely must be a float4 and a double2 type to express thesesemantics. For integers, I am not quite sure. I believe that integer SSEcommands can be mixed more so a single 128bit type would be sufficient.

Considering these hardware details of the SSE architecture alone, I fearthat portable low-level support for SIMD is very hard to achieve. If youwant to offer access to the raw power of each architecture, it might besimpler to have machine-specific language extensions for SIMD and leavethe portability for a wrapper library with a common front-end andvarious back-ends for the different architectures.

Re: SIMD support...

Reply via email to