Event logging not working when worker machine terminated

David Rosenstrauch Tue, 08 Sep 2015 20:15:43 -0700

Our Spark cluster is configured to write application history eventlogging to a directory on HDFS. This all works fine. (I've tested itwith Spark shell.)

However, on a large, long-running job that we ran tonight, one of ourmachines at the cloud provider had issues and had to be terminated andreplaced in the middle of the job.

The job completed correctly, and shows in state FINISHED in the"Completed Applications" section of the Spark GUI. However, when I tryto look at the application's history, the GUI says "Application historynot found" and "Application ... is still in progress".

The reason appears to be the machine that was terminated. When I clickon the executor list for that job, Spark is showing the executor fromthe terminated machine as still in state RUNNING.


Any solution/workaround for this?  BTW, I'm running Spark v1.3.0.

Thanks,

DR

---------------------------------------------------------------------
To unsubscribe, e-mail: user-unsubscr...@spark.apache.org
For additional commands, e-mail: user-h...@spark.apache.org

Event logging not working when worker machine terminated

Reply via email to