Joseph K. Bradley created SPARK-3042:
----------------------------------------
Summary: DecisionTree filtering is very inefficient
Key: SPARK-3042
URL: https://issues.apache.org/jira/browse/SPARK-3042
Project: Spark
Issue Type: Improvement
Components: MLlib
Reporter: Joseph K. Bradley
DecisionTree needs to match each example to a node at each iteration. It
currently does this with a set of filters very inefficiently: For each example,
it examines each node at the current level and traces up to the root to see if
that example should be handled by that node.
Proposed fix: Filter top-down using the partly built tree itself.
--
This message was sent by Atlassian JIRA
(v6.2#6252)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]