[ https://issues.apache.org/jira/browse/SPARK-3042?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14099303#comment-14099303 ]
Apache Spark commented on SPARK-3042: ------------------------------------- User 'jkbradley' has created a pull request for this issue: https://github.com/apache/spark/pull/1975 > DecisionTree filtering is very inefficient > ------------------------------------------ > > Key: SPARK-3042 > URL: https://issues.apache.org/jira/browse/SPARK-3042 > Project: Spark > Issue Type: Improvement > Components: MLlib > Reporter: Joseph K. Bradley > Assignee: Joseph K. Bradley > > DecisionTree needs to match each example to a node at each iteration. It > currently does this with a set of filters very inefficiently: For each > example, it examines each node at the current level and traces up to the root > to see if that example should be handled by that node. > Proposed fix: Filter top-down using the partly built tree itself. -- This message was sent by Atlassian JIRA (v6.2#6252) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org