On Sun, 25 Dec 2011 10:50:41 -0800, Lewis Anderson <[email protected]> wrote: > Andreas, > > I think that works. I realized after sending this message that > out-of-order execution is disabled by default, which makes everything > a little simpler. On an unrelated note, I also discovered that > allocation is quite slow! This discovery allowed me to achieve another > 3x improvement.
Alloc speed depends on the implementation obviously, but it's really quite bad on Nvidia. PyOpenCL's memory pools can help with that if you haven't already gone a different route. http://documen.tician.de/pyopencl/tools.html#memory-pools > A third question: What is the best way to contribute > code/documentation back to PyOpenCL? Clone the git repo, send a patch to the mailing list. :) Andreas
pgpKzbFOFuM4j.pgp
Description: PGP signature
_______________________________________________ PyOpenCL mailing list [email protected] http://lists.tiker.net/listinfo/pyopencl
