Stanislav, I'm getting ~7x speed-up on my GTX280:
*Tgpu= 0.0185270309448 Tcpu= 0.125254869461 0.612439375 0.612439375 0 *You may want to consider the "mem_alloc_host" (more at http://www.ddj.com/cpp/217500110). Best,* * On Wed, Jun 24, 2009 at 5:03 AM, Stanislav Ravas <[email protected]> wrote: > Good day, > > I'm new to both CUDA and PyCUDA. I'm trying to write binary > erosion/dilation accelerator module for my project, but they are slower > then scipy.ndimage's functions. > > I don't know if i'm doing something wrong(as I said, I'm new), or nvidia > nvs140m in my notebook is just not fast enough. > > It would be great if someone with more powerful card could try it, or > may be some guru :) could have a look into my sources? > > Source is attached. > > If I get it to work, I'll share it for all :) > > Anyway, CUDA and PyCUDA are great work! > > Thanks > > > _______________________________________________ > PyCUDA mailing list > [email protected] > http://tiker.net/mailman/listinfo/pycuda_tiker.net > > -- Nicolas Pinto Ph.D. Candidate, Brain & Computer Sciences Massachusetts Institute of Technology, USA http://web.mit.edu/pinto
_______________________________________________ PyCUDA mailing list [email protected] http://tiker.net/mailman/listinfo/pycuda_tiker.net
