> On Feb 17, 2017, at 8:05pm, Karl Rupp <[email protected]> wrote:
>
> Hi,
>
> I suspect that your timings include kernel compilation times. Please have a
> 'warmup' call of norm_1() outside your timing region.
>
> Which GPU do you have on your Macbook Pro?
>
> Best regards,
> Karli
Thanks Karl & Oswin,
The GPU is an Intel HD Graphics 4000.
What worries me is not only the warmup, but the copy phase. In release it can
take a lot of time for a simple 1M items (169 seconds!). I suspect I should try
to do everything on the GPU. Not only that, but this call:
viennacl::vector<float> gpuv;
gpuv.resize(size);
is taking 6 times what it takes on the CPU.
How can I avoid these overheads? I am quite new with OpenCL...
Thank you!
------------------------------------------------------------------------------
Check out the vibrant tech community on one of the world's most
engaging tech sites, SlashDot.org! http://sdm.link/slashdot
_______________________________________________
ViennaCL-devel mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/viennacl-devel