[
https://issues.apache.org/jira/browse/SPARK-6192?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14353347#comment-14353347
]
Xiangrui Meng commented on SPARK-6192:
--------------------------------------
[~Manglano] and [~leckie-chn] Thanks for your interests in GSoC & Spark MLlib!
As [~MechCoder] mentioned, this JIRA was created for him based on his past
experience and recent contributions to Spark MLlib. We tried to set a theme for
the project but make the actual tasks flexible. So it doesn't mean that we are
blocking others from implementing these features. You can contribute any of
these features at any time.
It would be great if you can start with some small features or helping review
others' PRs. We need to know each other before we can plan a GSoC project, but
I'm afraid that we may not have enough time to make it happen this year.
Anyway, this is a good place to start:
https://cwiki.apache.org/confluence/display/SPARK/Contributing+to+Spark
> Enhance MLlib's Python API (GSoC 2015)
> --------------------------------------
>
> Key: SPARK-6192
> URL: https://issues.apache.org/jira/browse/SPARK-6192
> Project: Spark
> Issue Type: Umbrella
> Components: ML, MLlib, PySpark
> Reporter: Xiangrui Meng
> Assignee: Manoj Kumar
> Labels: gsoc, gsoc2015, mentor
>
> This is an umbrella JIRA for [~MechCoder]'s GSoC 2015 project. The main theme
> is to enhance MLlib's Python API, to make it on par with the Scala/Java API.
> The main tasks are:
> 1. For all models in MLlib, provide save/load method. This also
> includes save/load in Scala.
> 2. Python API for evaluation metrics.
> 3. Python API for streaming ML algorithms.
> 4. Python API for distributed linear algebra.
> 5. Simplify MLLibPythonAPI using DataFrames. Currently, we use
> customized serialization, making MLLibPythonAPI hard to maintain. It
> would be nice to use the DataFrames for serialization.
> I'll link the JIRAs for each of the tasks.
> Note that this doesn't mean all these JIRAs are pre-assigned to [~MechCoder].
> The TODO list will be dynamic based on the backlog.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]