Niketan Pansare created SYSTEMML-1034:
-----------------------------------------

             Summary: Implement solve builtin function using cublas kernels
                 Key: SYSTEMML-1034
                 URL: https://issues.apache.org/jira/browse/SYSTEMML-1034
             Project: SystemML
          Issue Type: Sub-task
          Components: Runtime
            Reporter: Niketan Pansare


1. Extend BinaryOp to enable GPU for solve
2. Add MatrixMatrixBuiltinGPUInstruction and use JCuBlas2's cublasDtrsmBatched 
and cublasDgeqrfBatched (or cublasDgetrfBatched) methods.

For reference implementation, see 
https://github.com/apache/incubator-systemml/blob/master/src/main/java/org/apache/sysml/runtime/matrix/data/LibCommonsMath.java#L97



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to