Re: Spark app gets slower as it gets executed more times

2014-02-06 Thread Aaron Davidson
Sorry for delay, by long-running I just meant if you were running an iterative algorithm that was slowing down over time. We have observed this in the spark-perf benchmark; as file system state builds up, the job can slow down. Once the job finishes, however, it is cleaned up and should not affect

Re: Spark app gets slower as it gets executed more times

2014-02-04 Thread Aureliano Buendia
On Mon, Feb 3, 2014 at 12:26 AM, Aaron Davidson wrote: > Are you seeing any exceptions in between running apps? Does restarting the > master/workers actually cause Spark to speed back up again? It's possible, > for instance, that you run out of disk space, which should cause exceptions > but not

Re: Spark app gets slower as it gets executed more times

2014-02-02 Thread Aaron Davidson
Are you seeing any exceptions in between running apps? Does restarting the master/workers actually cause Spark to speed back up again? It's possible, for instance, that you run out of disk space, which should cause exceptions but not go away by restarting the master/workers. One thing to worry abo

Re: Spark app gets slower as it gets executed more times

2014-02-01 Thread 尹绪森
Is your spark app an iterative one ? If so, your app is creating a big DAG in every iteration. You should use checkpoint it periodically, say, 10 iterations one checkpoint. 2014-02-01 Aureliano Buendia : > Hi, > > I've noticed my spark app (on ec2) gets slower and slower as I repeatedly > execut

Spark app gets slower as it gets executed more times

2014-01-31 Thread Aureliano Buendia
Hi, I've noticed my spark app (on ec2) gets slower and slower as I repeatedly execute it. With a fresh ec2 cluster, it is snappy and takes about 15 mins to complete, after running the same app 4 times it gets slower and takes to 40 mins and more. While the cluster gets slower, the monitoring met