Hey,
After some investigations it looks like the problem is not with the GEMM
kernel but with the way the kernel is enqueued. It fails when A and B are
associated with the same handle in C = alpha*op(A)*op(A) + beta*C... (this
handle-checking feature is to allow for some optimizations in other
Until this is fixed, I disable the use of the generator for GEMM.
2014-07-07 15:00 GMT+02:00 Philippe Tillet phil.til...@gmail.com:
Hey,
After some investigations it looks like the problem is not with the GEMM
kernel but with the way the kernel is enqueued. It fails when A and B are