Re: core.simd woes

jerro Tue, 07 Aug 2012 07:00:28 -0700

I can see your reasoning, but I think that should be incore.sse, orcore.simd.sse personally. Or you'll end up with VMX, NEON, etcall blobbed
in one huge intrinsic wrapper file.


I would be okay with core.simd.sse or core.sse.

That said, almost all simd opcodes are directly accessible instd.simd.There are relatively few obscure operations that don't have arepresenting
function.
The unpck/shuf example above for instance, they botheffectively perform a
sort of swizzle, and both are accessible through swizzle!().

They aren't. Swizzle only takes one argument, so you cant use itto select elements from two vectors. Both unpcklps and shufpstake two arguments. Writing a swizzle with two arguments would bemuch harder.

The swizzle
mask is analysed by the template, and it produces the bestopcode to matchthe pattern. Take a look at swizzle, it's bloody complicated todo that the
most efficient way on x86.

Now imagine how complicated it would be to write a swizzle withto vector arguments.

The reason I didn't write the DMD support yet is because it wasincomplete,and many opcodes weren't yet accessible, like shuf forinstance... and Ijust wasn't finished. Stopped to wait for DMD to be featurecomplete.I'm not opposed to this idea, although I do have a concernthat, becausethere's no __forceinline in D (or macros), adding another layerofabstraction will make maths code REALLY slow in unoptimisedbuilds.Can you suggest a method where these would be treated as Cmacros, and not
produce additional layers of function calls?

Unfortunately I can't, at least not a clean one. Using stringmixins would be one way but I think no one wants that kind of APIin Druntime or Phobos.

I'm already unhappy that
std.simd produces redundant function calls.


<rant> please  please please can haz __forceinline! </rant>


I agree that we need that.

Re: core.simd woes

Reply via email to