Re: Driver staggering task launch times

Ara Vartanian Thu, 13 Aug 2015 23:04:11 -0700
I’m running on Yarn.

> On Aug 13, 2015, at 10:58 PM, Philip Weaver <philip.wea...@gmail.com> wrote:
> 
> Are you running on mesos, yarn or standalone? If you're on mesos, are you 
> using coarse grain or fine grained mode?
> 
> On Thu, Aug 13, 2015 at 10:13 PM, Ara Vartanian <arav...@cs.wisc.edu 
> <mailto:arav...@cs.wisc.edu>> wrote:
> I’m observing an unusual situation where my step duration increases as I add 
> further executors to my cluster. My algorithm is fully data parallelizable 
> into a map phase, followed by a reduce step at the end that amounts to matrix 
> addition. So I’ve kicked a cluster of, say, 100 executors with 4 cores per 
> executor and before running the algorithm I’ve repartitioned the RDD into 400 
> partitions. I can see in the Spark UI that each of the 400 (map) tasks takes 
> about 2 seconds. However, the entire step is taking over a minute, and this 
> is because the launch times of the tasks as reported in the Spark UI are 
> staggered. For example, the first 100 might be launched in the same second, 
> then another group 3 seconds later, and so forth (with the durations slowly 
> expanding). With a task time of 2 seconds, this “launch lag” is dominating 
> the computation time and only gets worse as I add nodes.
> 
> Any insight on how I could go about diagnosing this would be greatly 
> appreciated.
> 
> 
> 
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: user-unsubscr...@spark.apache.org 
> <mailto:user-unsubscr...@spark.apache.org>
> For additional commands, e-mail: user-h...@spark.apache.org 
> <mailto:user-h...@spark.apache.org>
> 
>
Re: Driver staggering task launch times

Reply via email to