[ 
https://issues.apache.org/jira/browse/FLINK-1873?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15274177#comment-15274177
 ] 

Simone Robutti commented on FLINK-1873:
---------------------------------------

I began working right away on this issue. 

For now I'm focusing on an indexed row matrix format but I will probably 
implement a partitioned format with the same operations to perform some 
operations in a more straightforward way. I will write conversions from one 
format to the other.

For now I'm just initializing the distributed data structure and writing 
conversions to local formats (COO, Sparse, Dense). I'm doing everything with 
the standards of the local linear algebra package (indices as Int, values as 
Doubles, same names for methods and so on). Also I'm working with Flink's 
implementations of all these classes. Is it ok or should I go directly to 
Breeze's implementations?

Then I will start thinking about common operations (multiplication, dot 
product, svd (?), ATA and so on).

> Distributed matrix implementation
> ---------------------------------
>
>                 Key: FLINK-1873
>                 URL: https://issues.apache.org/jira/browse/FLINK-1873
>             Project: Flink
>          Issue Type: New Feature
>          Components: Machine Learning Library
>            Reporter: liaoyuxi
>            Assignee: Simone Robutti
>              Labels: ML
>
> It would help to implement machine learning algorithm more quickly and 
> concise if Flink would provide support for storing data and computation in 
> distributed matrix. The design of the implementation is attached.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to