Re: D and GPGPU

luminousone via Digitalmars-d Wed, 18 Feb 2015 10:18:13 -0800

On Wednesday, 18 February 2015 at 15:15:21 UTC, Russel Winderwrote:

It strikes me that D really ought to be able to work with GPGPU– isthere already something and I just failed to notice. This isdataparallelism but of a slightly different sort to that instd.parallelism.std.concurrent, std.parallelism, std.gpgpu ought to beharmonious
though.
The issue is to create a GPGPU kernel (usually C code withbizarre datastructures and calling conventions) set it running and thenpipe data inand collect data out – currently very slow but the nextgeneration ofIntel chips will fix this (*). And then there is theOpenCL/CUDA debate.
Personally I think OpenCL, for all it's deficiencies, as it isvendorneutral. CUDA binds you to NVIDIA. Anyway there is an NVIDIAback endfor OpenCL. With a system like PyOpenCL, the infrastructuredata andprocess handling is abstracted, but you still have to write thekernelsin C. They really ought to do a Python DSL for that, but… Sowith D canwe write D kernels and have them compiled and loaded using acombination
of CTFE, D → C translation, C ompiler call, and other magic?

Is this a GSoC 2015 type thing?
(*) It will be interesting to see how NVIDIA responds to thetack Intel
are taking on GPGPU and main memory access.


https://github.com/HSAFoundation

This is really the way to go, yea opencl and cuda exist, alongwith opengl/directx compute shaders, but pretty much every thingout their suffers from giant limitations.

With HSA, HSAIL bytecode is embedded directly into the elf/exefile, HASIL bytecode can can fully support all the features ofc++, virtual function lookups in code, access to the stack, cachecoherent memory access, the same virtual memory view as theapplication it runs in, etc.

HSA is implemented in the llvm backend compiler, and when it isused in a elf/exe file, their is a llvm based finalizer thatgenerates gpu bytecode.

More importantly, it should be very easy to implement in any llvmsupported language once all of the patches are moved up stream totheir respective libraries/toolsets.

I believe that linux kernel 3.19 and above have the iommu 2.5patches, and I think amd's radeon KFD driver made it into 3.20.HSA will also be supported by ARM.

HSA is generic enough, that assuming Intel implements similarcapabilities into their chips it otta be supportable their withor without intels direct blessing.

HSA does work with discrete gpu's and not just the embeddedstuff, And I believe that HSA can be used to accelerate OpenCL2.0, via copyless cache coherent memory access.

Re: D and GPGPU

Reply via email to