Dmitriy Lyubimov created MAHOUT-1346:
----------------------------------------

             Summary: Spark Bindings (DRM)
                 Key: MAHOUT-1346
                 URL: https://issues.apache.org/jira/browse/MAHOUT-1346
             Project: Mahout
          Issue Type: Improvement
    Affects Versions: 0.8
            Reporter: Dmitriy Lyubimov
            Assignee: Dmitriy Lyubimov
             Fix For: 0.9


Spark bindings for Mahout DRM. 
DRM DSL. 

Disclaimer. This will all be experimental at this point.

The idea is to wrap DRM by Spark RDD with support of some basic functionality, 
perhaps some humble beginning of Cost-based optimizer 

(0) Spark serialization support for Vector, Matrix 
(1) Bagel transposition 
(2) slim X'X
(2a) not-so-slim X'X
(3) blockify() (compose RDD containing vertical blocks of original input)
(4) read/write Mahout DRM off HDFS
(5) A'B

...



--
This message was sent by Atlassian JIRA
(v6.1#6144)

Reply via email to