Spark 2.0.0 TreeAggregate with larger depth will be OOM?

Jy Chen Thu, 13 Oct 2016 08:33:12 -0700

Hi,all
I'm using Spark 2.0.0 to train a model with 1000w+ parameters, about 500GB
data. The treeAggregate is used to aggregate the gradient, when I set the
depth = 2 or 3, it works, and depth equals to 3 is faster.
So I set depth = 4 to obtain better performance, but now some executors
will be OOM in the shuffle phase. Why would this happen? With deeper depth,
each executor should aggregate less records and use less memory, I don't
know why OOM happens. Can someone help?

Spark 2.0.0 TreeAggregate with larger depth will be OOM?

Reply via email to