[jira] [Commented] (SPARK-15069) GSoC 2016: Exposing more R and Python APIs for MLlib

2016-06-23 Thread Kai Jiang (JIRA)

[ 
https://issues.apache.org/jira/browse/SPARK-15069?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15347121#comment-15347121
 ] 

Kai Jiang commented on SPARK-15069:
---

*6/22/2016 - Week 5*
To-do items
* Keep investigating more differences between MLlib API and R API with Decision 
Tree
* Same thing to start with Random Forest
* Continue to change PR for Decision Tree wrapper according to investigation.

> GSoC 2016: Exposing more R and Python APIs for MLlib
> 
>
> Key: SPARK-15069
> URL: https://issues.apache.org/jira/browse/SPARK-15069
> Project: Spark
>  Issue Type: Umbrella
>  Components: ML, PySpark, SparkR
>Reporter: Joseph K. Bradley
>Assignee: Kai Jiang
>  Labels: gsoc2016, mentor
> Attachments: 1458791046_[GSoC2016]ApacheSpark_KaiJiang_Proposal.pdf
>
>
> This issue is for tracking the Google Summer of Code 2016 project for Kai 
> Jiang: "Apache Spark: Exposing more R and Python APIs for MLlib"
> See attached proposal for details.  Note that the tasks listed in the 
> proposal are tentative and can adapt as the community works on these various 
> parts of MLlib.
> This umbrella will contain links for tasks included in this project, to be 
> added as each task begins.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Commented] (SPARK-15069) GSoC 2016: Exposing more R and Python APIs for MLlib

2016-06-16 Thread Joseph K. Bradley (JIRA)

[ 
https://issues.apache.org/jira/browse/SPARK-15069?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15334622#comment-15334622
 ] 

Joseph K. Bradley commented on SPARK-15069:
---

h4. 6/16/2016 - Week 4

To-do items
* Continuation of doc items: [SPARK-15672]
* Decision tree API [SPARK-15767] -> I'll add notes to this JIRA
* If there is time, begin work on forests or boosting.


> GSoC 2016: Exposing more R and Python APIs for MLlib
> 
>
> Key: SPARK-15069
> URL: https://issues.apache.org/jira/browse/SPARK-15069
> Project: Spark
>  Issue Type: Umbrella
>  Components: ML, PySpark, SparkR
>Reporter: Joseph K. Bradley
>Assignee: Kai Jiang
>  Labels: gsoc2016, mentor
> Attachments: 1458791046_[GSoC2016]ApacheSpark_KaiJiang_Proposal.pdf
>
>
> This issue is for tracking the Google Summer of Code 2016 project for Kai 
> Jiang: "Apache Spark: Exposing more R and Python APIs for MLlib"
> See attached proposal for details.  Note that the tasks listed in the 
> proposal are tentative and can adapt as the community works on these various 
> parts of MLlib.
> This umbrella will contain links for tasks included in this project, to be 
> added as each task begins.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Commented] (SPARK-15069) GSoC 2016: Exposing more R and Python APIs for MLlib

2016-06-06 Thread Joseph K. Bradley (JIRA)

[ 
https://issues.apache.org/jira/browse/SPARK-15069?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15317151#comment-15317151
 ] 

Joseph K. Bradley commented on SPARK-15069:
---

h4. 6/6/2016 - Week 3

To-do items
* Continuation of items from previous week
* If there is time, start consideration of random forests + boosting.
* JIRA for tree API: [SPARK-15767]


> GSoC 2016: Exposing more R and Python APIs for MLlib
> 
>
> Key: SPARK-15069
> URL: https://issues.apache.org/jira/browse/SPARK-15069
> Project: Spark
>  Issue Type: Umbrella
>  Components: ML, PySpark, SparkR
>Reporter: Joseph K. Bradley
>Assignee: Kai Jiang
>  Labels: gsoc2016, mentor
> Attachments: 1458791046_[GSoC2016]ApacheSpark_KaiJiang_Proposal.pdf
>
>
> This issue is for tracking the Google Summer of Code 2016 project for Kai 
> Jiang: "Apache Spark: Exposing more R and Python APIs for MLlib"
> See attached proposal for details.  Note that the tasks listed in the 
> proposal are tentative and can adapt as the community works on these various 
> parts of MLlib.
> This umbrella will contain links for tasks included in this project, to be 
> added as each task begins.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Commented] (SPARK-15069) GSoC 2016: Exposing more R and Python APIs for MLlib

2016-05-31 Thread Joseph K. Bradley (JIRA)

[ 
https://issues.apache.org/jira/browse/SPARK-15069?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15308552#comment-15308552
 ] 

Joseph K. Bradley commented on SPARK-15069:
---

h4. 5/31/2016 - Week 2

To-do items
* Send minor PR to link R/DOCUMENTATION.md from R/README.md
* Help with R programming guide update: [SPARK-15672]
* Decision Tree API for SparkR
** Create JIRA
** Propose API, referencing MLlib API + R libraries.  Could do this on the 
JIRA, or in a linked doc
** Create MVP based on existing MLlib APIs
** Later, we can add more functionality, such as viewing the structure of the 
tree from R


> GSoC 2016: Exposing more R and Python APIs for MLlib
> 
>
> Key: SPARK-15069
> URL: https://issues.apache.org/jira/browse/SPARK-15069
> Project: Spark
>  Issue Type: Umbrella
>  Components: ML, PySpark, SparkR
>Reporter: Joseph K. Bradley
>Assignee: Kai Jiang
>  Labels: gsoc2016, mentor
> Attachments: 1458791046_[GSoC2016]ApacheSpark_KaiJiang_Proposal.pdf
>
>
> This issue is for tracking the Google Summer of Code 2016 project for Kai 
> Jiang: "Apache Spark: Exposing more R and Python APIs for MLlib"
> See attached proposal for details.  Note that the tasks listed in the 
> proposal are tentative and can adapt as the community works on these various 
> parts of MLlib.
> This umbrella will contain links for tasks included in this project, to be 
> added as each task begins.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Commented] (SPARK-15069) GSoC 2016: Exposing more R and Python APIs for MLlib

2016-05-26 Thread Joseph K. Bradley (JIRA)

[ 
https://issues.apache.org/jira/browse/SPARK-15069?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15302699#comment-15302699
 ] 

Joseph K. Bradley commented on SPARK-15069:
---

h3. 5/23/2016 - Week 1
Initial items
* QA
** (SPARK-15490)
** Ping about helping with (SPARK-14809)
* Ping on (SPARK-15439)
* If time permits, begin work on Decision Tree API for SparkR
** Create JIRA
** Propose API
** Prototype locally


> GSoC 2016: Exposing more R and Python APIs for MLlib
> 
>
> Key: SPARK-15069
> URL: https://issues.apache.org/jira/browse/SPARK-15069
> Project: Spark
>  Issue Type: Umbrella
>  Components: ML, PySpark, SparkR
>Reporter: Joseph K. Bradley
>Assignee: Kai Jiang
>  Labels: gsoc2016, mentor
> Attachments: 1458791046_[GSoC2016]ApacheSpark_KaiJiang_Proposal.pdf
>
>
> This issue is for tracking the Google Summer of Code 2016 project for Kai 
> Jiang: "Apache Spark: Exposing more R and Python APIs for MLlib"
> See attached proposal for details.  Note that the tasks listed in the 
> proposal are tentative and can adapt as the community works on these various 
> parts of MLlib.
> This umbrella will contain links for tasks included in this project, to be 
> added as each task begins.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org