On Sonntag 22 Juni 2008, Kevin Jacobs <[EMAIL PROTECTED]> wrote: > Thanks for the clarification. That makes perfect sense. Do you have any > feelings on the relative performance of GPUArray versus CUBLAS?
Same. If you check out the past version of PyCuda that still has CUBLAS, there
are files test/test_{cublas,gpuarray}_speed.py. In fact, since CUBLAS does
not implement three-operand "z = x + y", it requires an extra copy that
GPUArray can avoid. If you're into "lies, damned lies and benchmarks", you
could say that GPUArray is actually twice as fast. :)
> The first part of install.rst still says: "This tutorial will walk you
> through the process of building PyUblas."
Oops. Thanks. Fixed.
Andreas
signature.asc
Description: This is a digitally signed message part.
_______________________________________________ Numpy-discussion mailing list [email protected] http://projects.scipy.org/mailman/listinfo/numpy-discussion
