[jira] [Commented] (SPARK-3042) DecisionTree filtering is very inefficient

Apache Spark (JIRA) Fri, 15 Aug 2014 15:43:42 -0700

    [ 
https://issues.apache.org/jira/browse/SPARK-3042?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14099303#comment-14099303
 ]


Apache Spark commented on SPARK-3042:
-------------------------------------

User 'jkbradley' has created a pull request for this issue:
https://github.com/apache/spark/pull/1975

> DecisionTree filtering is very inefficient
> ------------------------------------------
>
>                 Key: SPARK-3042
>                 URL: https://issues.apache.org/jira/browse/SPARK-3042
>             Project: Spark
>          Issue Type: Improvement
>          Components: MLlib
>            Reporter: Joseph K. Bradley
>            Assignee: Joseph K. Bradley
>
> DecisionTree needs to match each example to a node at each iteration.  It 
> currently does this with a set of filters very inefficiently: For each 
> example, it examines each node at the current level and traces up to the root 
> to see if that example should be handled by that node.
> Proposed fix: Filter top-down using the partly built tree itself.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[jira] [Commented] (SPARK-3042) DecisionTree filtering is very inefficient

Reply via email to