GitHub user manishamde opened a pull request:

    https://github.com/apache/spark/pull/402

    Decision Tree documentation for MLlib programming guide

    Added documentation for user to use the decision tree algorithms for 
classification and regression in Spark 1.0 release.
    
    Apart from a general review, I need specific input on the following:
    * I had to move a lot of the existing documentation under the *linear 
methods* umbrella to accommodate decision trees. I wonder if there is a better 
way to organize the programming guide given we are so close to the release.
    * I have not looked closely at pyspark but I am wondering new mllib 
algorithms are automatically plugged in or do we need to some extra work to 
call mllib functions from pyspark. I will add to the pyspark examples based 
upon the advice I get.
    
    cc: @mengxr, @hirakendu, @etrain, @atalwalkar

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/manishamde/spark tree_doc

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/spark/pull/402.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #402
    
----
commit d06511dbde8d85d8caa8294aff3bd86094f14923
Author: Manish Amde <[email protected]>
Date:   2014-04-13T03:26:33Z

    basic skeleton

commit 1537dd372a3e64251923792d9cc911dff12ed85f
Author: Manish Amde <[email protected]>
Date:   2014-04-13T05:32:24Z

    added placeholders and some doc

commit 3ecb2ad8a0a004a89debe5a5ce5b0d72181b9305
Author: Manish Amde <[email protected]>
Date:   2014-04-13T05:44:42Z

    minor text addition

commit b93125cce2d8a6a685aac0e7cda2357df0ddb09b
Author: Manish Amde <[email protected]>
Date:   2014-04-13T18:09:34Z

    more subsection reorg

commit 94fd2f9cf6aaf8c99f5ebc76d207a70abc82f832
Author: Manish Amde <[email protected]>
Date:   2014-04-13T18:38:42Z

    more reorg

commit 69252752172ec173bc9e705cf2e9194a83c46f9a
Author: Manish Amde <[email protected]>
Date:   2014-04-13T19:59:35Z

    impurity and information gain

commit 9c0c4be00ecf04ee42926ca11f99b07c1c1ab0c6
Author: Manish Amde <[email protected]>
Date:   2014-04-13T21:11:37Z

    split candidate

commit f427e84dabf9e1bdf95b65d258bedc5c85bb7577
Author: Manish Amde <[email protected]>
Date:   2014-04-13T23:42:13Z

    renaming sections

commit 6e297d7cc0b04c5586bf80c0be7b3de69c1a606a
Author: Manish Amde <[email protected]>
Date:   2014-04-14T00:26:43Z

    added subsections

commit b9ef6c4088429efeed8ddb63e70d6ee6516443df
Author: Manish Amde <[email protected]>
Date:   2014-04-14T01:14:38Z

    basic decision tree code examples

commit dbb0e5e4c3e5bfc14da1ad76e8a8f7ec2a2c0ac8
Author: Manish Amde <[email protected]>
Date:   2014-04-14T01:26:13Z

    minor improvements to text

commit 865826ee04da10260aeb1db72f1da13f730e678d
Author: Manish Amde <[email protected]>
Date:   2014-04-14T01:37:37Z

    minor: grammar

commit 022485ad965a75bd48aadd3852850fc2a0c9d5c6
Author: Manish Amde <[email protected]>
Date:   2014-04-14T05:40:22Z

    more documentation

----


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---

Reply via email to