[PyCUDA] pycuda.driver.Context.synchronize() delay time is a function of the count and kind of sram accesses?

2010-01-02 Thread Hampton G. Miller
I have noticed something which seems odd and which I hope you will look at and then tell me if it is something unique to PyCUDA or else is something which should be brought to the attention of Nvidia. (Or, that I am just a simpleton!) Looking at my test results, below, and referring to my

Re: [PyCUDA] pycuda.driver.Context.synchronize() delay time is a function of the count and kind of sram accesses?

2010-01-02 Thread Andreas Klöckner
Couple points: * bytewise smem write may be slow? * sync before and after timed operation, otherwise you time who knows what * or, even better, use events. HTH, Andreas On Samstag 02 Januar 2010, Hampton G. Miller wrote: I have noticed something which seems odd and which I hope you will look