Cool! A ready OO-wrapper. Thanks. I agree, the best-performance GPGPU code needs fine tuning (although any kind of computation on giant arrays of floats is much faster on a GPU, then on a multi-threaded CPU). That's why i want my data structures and computation to be fine-tunable in respect of hardware. And, optionally, I'd want it to smoothly integrate with my main D computation, from which the kernels are generated and processed.
On Wed, Sep 28, 2011 at 5:30 PM, Trass3r <[email protected]> wrote: >> Read news, specs, presentations about it. >> Have a clear overall understanding of it. >> Downloaded the API recently. >> Went through examples. > > Well GPGPU only comes with big performance gains if you hand-tune the > (kernel and API interaction) code. > Normally you even have to optimize it for specific GPUs to optimally use > local memory etc. > or you will end up being not much faster than a proper multi-threaded CPU > implementation in most cases. > >> Gonna try making hello world examples. > > If you want to use it with D: > https://bitbucket.org/trass3r/cl4d >
