Niketan Pansare created SYSTEMML-1034:
-----------------------------------------
Summary: Implement solve builtin function using cublas kernels
Key: SYSTEMML-1034
URL: https://issues.apache.org/jira/browse/SYSTEMML-1034
Project: SystemML
Issue Type: Sub-task
Components: Runtime
Reporter: Niketan Pansare
1. Extend BinaryOp to enable GPU for solve
2. Add MatrixMatrixBuiltinGPUInstruction and use JCuBlas2's cublasDtrsmBatched
and cublasDgeqrfBatched (or cublasDgetrfBatched) methods.
For reference implementation, see
https://github.com/apache/incubator-systemml/blob/master/src/main/java/org/apache/sysml/runtime/matrix/data/LibCommonsMath.java#L97
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)