the case, you will be better served looking at some of the examples
in the CUDA SDK instead. I'm sure people on this list could give even better
suggestions if you asked.
Alright thanks a lot.
Best regards,
./francis
___
PyCUDA mailing list
PyCUDA@tiker.net
the O( n ) task in the device thus making it have a constant
running time relatively speaking. Of course that's an ideal case and doesn't
consider the device-host copy delays that much.
Best regards,
./francis
___
PyCUDA mailing list
PyCUDA@tiker.net
Thanks for the suggestion. :) I will try atomic operations and see if they
can fit my need. :)
Best regards,
./francis
If I understand problem correctly, you can try to use one thread
to compute length of each list. Then you can use atomic functions
to find the longest list. This way
mind what the values of the *strings* are, just their count for
each sub-list. I suppose I could just have one thread per sub-list work on
the length of that entire sub-list though I've yet to try that out. :)
Best regards,
./francis
2011/8/24 David Mertens dcmertens.p...@gmail.com
Sorry, meant
with the largest length. Essentially I would
want to have the index of that sub-list so perhaps there's a way to make
each thread do len( ). Then I could just use a host version of max( ) to get
the largest value. Thanks in advance guys and gals.
Best regards,
./francis
in PyCUDA.
Best regards,
./francis
2011/8/4 Lev Givon l...@columbia.edu
Received from Francis on Thu, Aug 04, 2011 at 06:07:40AM EDT:
Hi everyone,
I'm wondering how I can use CUDA C math functions (compiled by adding
-use_fast_math to nvcc) in PyCUDA if I have a source module with a
function
before and after the kernel calls. In our Intel Xeon
setup this is about several hundred microseconds, but I want to be sure if
this is an accurate way of getting the run time. :)
Regards,
./francis
___
PyCUDA mailing list
PyCUDA@tiker.net
http
Thanks! Also found out about those PyCUDA drivers to create events. :)
Regards,
./francis
2011/6/13 Jesse Lu jess...@stanford.edu
Hey,
I used cuda events to measure kernel execution time (and to make sure
kernels were running concurrently). The concurrent kernel example is at
http