Has anyone tried using cudaprof to profile their pycuda code? I can run cudaprof and get top-level summary stats about total method and memcopy runtimes, but I can't seem to get the counters to work. I check all the counter checkboxes but nothing gets reported.
I am running on Windows Vista 32-bit with a 9800GTS card. Thanks in advance for any help with this, Tom
_______________________________________________ PyCuda mailing list [email protected] http://tiker.net/mailman/listinfo/pycuda_tiker.net
