Re: GPGPU progess

Nicholas Wilson via Digitalmars-d Thu, 18 May 2017 03:31:38 -0700

On Thursday, 18 May 2017 at 09:07:38 UTC, Nicholas Wilson wrote:

When ldc runs you will get a kernels_cudaxxx_yy.ptx (where xxxis the CUDA compute capability specified on the command lineand yy is 32 or 64 for 32 or 64bit) which should fit somewhereinto your existing C++ pipeline.

Whoops, that assumes you have a CUDA driver API pipeline in yourC++ code, which if you're asking I'm not sure that you have.If you're using the `kernel<<<...>>>(args)` form to launch youkernels then you are going to have a lot more work to do in Dbecause you'll need to use the driver API(http://docs.nvidia.com/cuda/cuda-driver-api/#axzz4hQLA0Zdm)

You'll need to:
*get a device
*create a context from it
*get a stream on that context

*load the ptx module (possibly linking it with other modules, toresolve missing symbols).

*compile it for the device

*then launch a kernel from that module on that device, by namepassing the arguments in a void*[].

The sad thing is that its still nice than OpenCL because inOpenCL you have to pass the runtime args (with sizes) one by oneto a function.


Hence why I want to automate as much of that shit as is possible.

I hope to have that done ASAP, but I don't have hardware set upto test CUDA at the moment (I have one on my windows box but Idon't have dev set up there) and I'll be working on OpenCL at thesame time (and theres only so much horrible API I can take in aday).I'll be working on dcompute part-part-time next semester thoughso I should be able to get a fair bit done and quite a few othersare interested so that'll speed thing up a bit.

Re: GPGPU progess

Reply via email to