Re: SIMD support...

Walter Bright Fri, 06 Jan 2012 00:46:13 -0800

On 1/5/2012 5:42 PM, Manu wrote:

So I've been hassling about this for a while now, and Walter asked me to pitch
an email detailing a minimal implementation with some initial thoughts.


Takeaways:

1. SIMD behavior is going to be very machine specific.

2. Even trying to do something with + is fraught with peril, as integer addswith SIMD can be saturated or unsaturated.

3. Trying to build all the details about how each of the various adds and otherops work into the compiler/optimizer is a large undertaking. D would have tosupport internally maybe a 100 or more new operators.

So some simplification is in order, perhaps a low level layer that is fairlyextensible for new instructions, and for which a library can be layered over fora more presentable interface. A half-formed idea of mine is, taking a cue fromyours:


Declare one new basic type:

    __v128

which represents the 16 byte aligned 128 bit vector type. The only operationsdefined to work on it would be construction and assignment. The __ prefixsignals that it is non-portable.


Then, have:

   import core.simd;

which provides two functions:

   __v128 simdop(operator, __v128 op1);
   __v128 simdop(operator, __v128 op1, __v128 op2);

This will be a function built in to the compiler, at least for the x86. (Otherarchitectures can provide an implementation of it that simulates its operation,but I doubt that it would be worth anyone's while to use that.)


The operators would be an enum listing of the SIMD opcodes,

    PFACC, PFADD, PFCMPEQ, etc.

For:

    z = simdop(PFADD, x, y);

the compiler would generate:

    MOV z,x
    PFADD z,y

The code generator knows enough about these instructions to do registerassignments reasonably optimally.

What do you think? It ain't beeyoootiful, but it's implementable in a reasonableamount of time, and it should make writing tight & fast SIMD code without havingto do it all in assembler.

One caveat is it is typeless; a __v128 could be used as 4 packed ints or 2packed doubles. One problem with making it typed is it'll add 10 more types tothe base compiler, instead of one. Maybe we should just bite the bullet and dothe types:


    __vdouble2
    __vfloat4
    __vlong2
    __vulong2
    __vint4
    __vuint4
    __vshort8
    __vushort8
    __vbyte16
    __vubyte16

Re: SIMD support...

Reply via email to