Saisai Shao created SPARK-10739:
-----------------------------------
Summary: Add attempt window for long running Spark application on
Yarn
Key: SPARK-10739
URL: https://issues.apache.org/jira/browse/SPARK-10739
Project: Spark
Issue Type: Improvement
Components: YARN
Reporter: Saisai Shao
Priority: Minor
Currently Spark on Yarn uses max attempts to control the failure number, if
application's failure number reaches to the max attempts, application will not
be recovered by RM, it is not very for long running applications, since it will
easily exceed the max number, also setting a very large max attempts will hide
the real problem.
So here introduce an attempt window to control the application attempt times,
this will ignore the out of window attempts, it is introduced in Hadoop 2.6+ to
support long running application, it is quite useful for Spark Streaming, Spark
shell like applications.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]