Anyway to monitor the shuffling size?

Wenlei Xie Mon, 11 Nov 2013 00:28:29 -0800

Hi,

I have some shuffling task which is supposed to have may repeated values,
thus I assume the shuffling compress would help the performance .


However I get very similar running time whether I set
spark.shuffle.compress to be true/false. I would like to know whether it's
because my data cannot be compressed or not. Is there any way to monitor
the data shuffled for one transformation?

Best,
Wenlei

Anyway to monitor the shuffling size?

Reply via email to