[
https://issues.apache.org/jira/browse/SPARK-6100?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14377208#comment-14377208
]
Xiangrui Meng commented on SPARK-6100:
--------------------------------------
We don't have APIs for distributed matrices in Python. Now we have MatrixUDT
merged, it would be nice to create RowMatrix/CoordinateMatrix/BlockMatrix and
use DataFrames for serialization. Do you want to start with that task?
> Distributed linear algebra in PySpark/MLlib
> -------------------------------------------
>
> Key: SPARK-6100
> URL: https://issues.apache.org/jira/browse/SPARK-6100
> Project: Spark
> Issue Type: Umbrella
> Components: MLlib, PySpark
> Reporter: Xiangrui Meng
> Assignee: Xiangrui Meng
>
> This is an umbrella JIRA for the Python API of distributed linear algebra in
> MLlib. The goal is to make Python API on par with the Scala/Java API. We
> should try wrapping Scala implementations as much as possible, instead of
> implementing them in Python.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]