zhengruifeng edited a comment on pull request #31090: URL: https://github.com/apache/spark/pull/31090#issuecomment-759178976
@srowen reasonable. I just create a `RandomForestClassificationModel` with numTrees=100 and depth=20, then find that the model size is 226M. So I think for RF and GBT, we should keep current behavior. But for a DecisionTree, whose size is definitely small enough (I also create a decision tree with depth=30, its size is 3.9M), I think it is safe to use single partition. ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
