[ 
https://issues.apache.org/jira/browse/SPARK-6192?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14367701#comment-14367701
 ] 

Manoj Kumar commented on SPARK-6192:
------------------------------------

Thanks for your feedback. I've fixed it up (same link) adding an Importance 
section, denoting the importance of the project. Let me know if there is 
anything else to be done.

> Enhance MLlib's Python API (GSoC 2015)
> --------------------------------------
>
>                 Key: SPARK-6192
>                 URL: https://issues.apache.org/jira/browse/SPARK-6192
>             Project: Spark
>          Issue Type: Umbrella
>          Components: ML, MLlib, PySpark
>            Reporter: Xiangrui Meng
>            Assignee: Manoj Kumar
>              Labels: gsoc, gsoc2015, mentor
>
> This is an umbrella JIRA for [~MechCoder]'s GSoC 2015 project. The main theme 
> is to enhance MLlib's Python API, to make it on par with the Scala/Java API. 
> The main tasks are:
> 1. For all models in MLlib, provide save/load method. This also
> includes save/load in Scala.
> 2. Python API for evaluation metrics.
> 3. Python API for streaming ML algorithms.
> 4. Python API for distributed linear algebra.
> 5. Simplify MLLibPythonAPI using DataFrames. Currently, we use
> customized serialization, making MLLibPythonAPI hard to maintain. It
> would be nice to use the DataFrames for serialization.
> I'll link the JIRAs for each of the tasks.
> Note that this doesn't mean all these JIRAs are pre-assigned to [~MechCoder]. 
> The TODO list will be dynamic based on the backlog.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to