On Sun, 25 Dec 2011 10:50:41 -0800, Lewis Anderson <[email protected]> 
wrote:
> Andreas,
> 
> I think that works. I realized after sending this message that
> out-of-order execution is disabled by default, which makes everything
> a little simpler. On an unrelated note, I also discovered that
> allocation is quite slow! This discovery allowed me to achieve another
> 3x improvement.

Alloc speed depends on the implementation obviously, but it's really
quite bad on Nvidia. PyOpenCL's memory pools can help with that if you
haven't already gone a different route.

http://documen.tician.de/pyopencl/tools.html#memory-pools

> A third question: What is the best way to contribute
> code/documentation back to PyOpenCL?

Clone the git repo, send a patch to the mailing list. :)

Andreas

Attachment: pgpKzbFOFuM4j.pgp
Description: PGP signature

_______________________________________________
PyOpenCL mailing list
[email protected]
http://lists.tiker.net/listinfo/pyopencl

Reply via email to