Nakul Jindal created SYSTEMML-1436:
--------------------------------------

             Summary: Improve Sparse matrix support for GPU operations
                 Key: SYSTEMML-1436
                 URL: https://issues.apache.org/jira/browse/SYSTEMML-1436
             Project: SystemML
          Issue Type: Task
          Components: Runtime
            Reporter: Nakul Jindal


SystemML has a preliminary set of GPU implementation for its primitive 
operations (Matmult, reductions, neural net operations among others). 
Currently, these GPU operations work when SystemML is run on a single machine 
(either using the Standalone mode or Spark mode). Programs written in the 
external DSLs (DML & PyDML) and internal DSLs (Python and Scala) can enable the 
use of these GPUs.

SystemML is aware of sparsity in matrix blocks and encodes them differently. It 
has 3 different types of Sparse formats (CSR, COO & a custom MCSR). A lot of 
the GPU operations are implemented for dense matrix blocks; for some GPU 
operations, when sparse matrices are encountered, they are first converted to 
dense and then sent to the GPU.

- This project is to implement CUDA kernels for Sparse Matrix blocks
- Operations to be implemented include reductions, element-wise operations, 
neural network operations among others

This project is fairly isolated from the internal compiler & optimizer, 
therefore a thorough knowledge of the entire system will not be needed.

Knowledge of CUDA programming is preferred. 
For a initial implementation, the most efficient CUDA kernel is not required.

Rating - Medium

Mentors - [~nakul02], (optionally [~niketanpansare])



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Reply via email to