[
https://issues.apache.org/jira/browse/SPARK-10232?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Joseph K. Bradley updated SPARK-10232:
--------------------------------------
Attachment: GBT.png
RandomForest.png
> Decide whether spark.ml Decision Tree and Random Forest can replace
> spark.mllib implementation
> ----------------------------------------------------------------------------------------------
>
> Key: SPARK-10232
> URL: https://issues.apache.org/jira/browse/SPARK-10232
> Project: Spark
> Issue Type: Task
> Components: ML, MLlib
> Reporter: Joseph K. Bradley
> Assignee: Joseph K. Bradley
> Attachments: GBT.png, RandomForest.png
>
>
> This JIRA is for discussing replacing the spark.mllib DecisionTree and
> RandomForest implementations with the implementation in spark.ml. The new
> implementation is simply a copy, with slight modifications (removing "bins").
> Pros:
> * Support only 1 implementation.
> * Efficiency gains in spark.ml will benefit both APIs.
> Cons:
> * As spark.ml tree functionality increases, we will need to maintain
> conversion code for converting spark.ml trees to spark.mllib trees.
> Must:
> * Ensure we do not have significant regressions in the new implementation.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]