[GitHub] [spark] zhengruifeng edited a comment on pull request #31090: [SPARK-34047][ML] save decisiontree model in single partition

GitBox Tue, 12 Jan 2021 19:31:24 -0800


zhengruifeng edited a comment on pull request #31090:
URL: https://github.com/apache/spark/pull/31090#issuecomment-759178976



   @srowen reasonable.
   I just create a `RandomForestClassificationModel` with numTrees=100 and 
depth=20, then find that the model size is 226M. So I think for RF and GBT, we 
should keep current behavior.
   But for a DecisionTree, whose size is definitely small enough (I also create 
a decision tree with depth=30, its size is 3.9M), I think it is safe to use 
single partition.


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]



---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[GitHub] [spark] zhengruifeng edited a comment on pull request #31090: [SPARK-34047][ML] save decisiontree model in single partition

Reply via email to