[GitHub] spark pull request: MLI-1 Decision Trees

2014-03-12 Thread manishamde
Github user manishamde commented on a diff in the pull request: https://github.com/apache/spark/pull/79#discussion_r10531612 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/tree/DecisionTree.scala --- @@ -0,0 +1,1055 @@ +/* + * Licensed to the Apache Software

[GitHub] spark pull request: MLI-1 Decision Trees

2014-03-10 Thread manishamde
Github user manishamde commented on a diff in the pull request: https://github.com/apache/spark/pull/79#discussion_r10461572 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/tree/model/DecisionTreeModel.scala --- @@ -0,0 +1,58 @@ +/* + * Licensed to the Apache

[GitHub] spark pull request: MLI-1 Decision Trees

2014-03-10 Thread manishamde
Github user manishamde commented on a diff in the pull request: https://github.com/apache/spark/pull/79#discussion_r10461542 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/tree/DecisionTree.scala --- @@ -0,0 +1,1055 @@ +/* + * Licensed to the Apache Software

[GitHub] spark pull request: MLI-1 Decision Trees

2014-03-10 Thread manishamde
Github user manishamde commented on a diff in the pull request: https://github.com/apache/spark/pull/79#discussion_r10461519 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/tree/DecisionTree.scala --- @@ -0,0 +1,1055 @@ +/* + * Licensed to the Apache Software

[GitHub] spark pull request: MLI-1 Decision Trees

2014-03-10 Thread manishamde
Github user manishamde commented on the pull request: https://github.com/apache/spark/pull/79#issuecomment-37262156 @mengxr Thanks for such a detailed review. The code is already in a much better shape after incorporating your suggested changes. I have fixed the straightforward code

[GitHub] spark pull request: MLI-1 Decision Trees

2014-03-10 Thread manishamde
Github user manishamde commented on the pull request: https://github.com/apache/spark/pull/79#issuecomment-37213685 Thanks for another round of reviews @mengxr. I will fix these style issues, answer your comments and add more documentation around the optimizations later today

[GitHub] spark pull request: MLI-1 Decision Trees

2014-03-06 Thread manishamde
Github user manishamde commented on the pull request: https://github.com/apache/spark/pull/79#issuecomment-36973041 @mengxr @hsaputra Thanks for the code style comments. I have made a lot of effort to document the code. I guess I still need to make the code consistent with the Spark

[GitHub] spark pull request: MLI-1 Decision Trees

2014-03-05 Thread manishamde
Github user manishamde commented on the pull request: https://github.com/apache/spark/pull/79#issuecomment-36755664 Thanks Sean. Multi-class classification and feature importances are important features that will be added soon. We implemented a minimal feature set since we

[GitHub] spark pull request: MLI-1 Decision Trees

2014-03-04 Thread manishamde
GitHub user manishamde opened a pull request: https://github.com/apache/spark/pull/79 MLI-1 Decision Trees Joint work with @hirakendu, @etrain, @atalwalkar and @harsha2010. Key features: + Supports binary classification and regression + Supports gini, entropy and