[GitHub] [spark] 137alpha commented on pull request #32813: [SPARK-34591][MLLIB] Disable decision tree pruning

GitBox Tue, 08 Jun 2021 14:04:26 -0700


137alpha commented on pull request #32813:
URL: https://github.com/apache/spark/pull/32813#issuecomment-857145870



   @srowen 
   
   > Im still kind of curious how this happens. Is the pruning logic just not 
correct? Or do you have sense of what the tree is like before and after pruning?
   
   The pruning logic is implemented correctly (on the basis of class 
predictions) but a bad idea for the reasons given above (namely it considers 
the class, not the probability of the node).
   
   The logic could be fixed by doing the following: merge two nodes only if the 
probability predicted from both nodes is identical.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]



---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[GitHub] [spark] 137alpha commented on pull request #32813: [SPARK-34591][MLLIB] Disable decision tree pruning

Reply via email to