Hi, I'm invoking a kernel like using pycuda.driver.Out and using the
time_kernel=True argument to return the time in seconds. Does this
time include the device-to-host copy time?
Thanks! I'm happy to report a preliminary 165x speed up over our
existing radar imaging implementation thanks to
On Mittwoch 18 März 2009, William King wrote:
I'm glad to help. I'd like to try to get a test together that will
compute some algorithm on the GPU and one on the CPU and compare the
speeds.
Do you think it would be possible to define the algorithm once, and
choose where it is executed?
On Mittwoch 18 März 2009, William King wrote:
Ok, for someone new to python(but very familiar with perl, java, php)
and new to CUDA. I have been able to setup my test enviroment and get
the test_driver.py and all to run properly.
I would like to help document(while I learn) how to use PyCuda