[ 
https://issues.apache.org/jira/browse/YARN-2266?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14057127#comment-14057127
 ] 

Ashutosh Jindal commented on YARN-2266:
---------------------------------------

bq.What happens next? The apps are going to be resubmitted and they will still 
wait in the queue.
No, the same application will not be submitted again. Consider a case where an 
application run periodically every hour and the average time for the app 
completion is 30 mins. In such case, if the application is not getting 
resources for 30 mins or say it gets the resources after 30 mins, it is better 
to kill the application and let the next application serve the purpose.

> Add an application timeout service in RM to kill applications which are not 
> getting resources
> ---------------------------------------------------------------------------------------------
>
>                 Key: YARN-2266
>                 URL: https://issues.apache.org/jira/browse/YARN-2266
>             Project: Hadoop YARN
>          Issue Type: Improvement
>          Components: resourcemanager
>            Reporter: Ashutosh Jindal
>
> Currently , If an application is submitted to RM, the app keeps waiting until 
> the resources are allocated for AM. Such an application may be stuck till a 
> resource is allocated for AM, and this may be due to over utilization of 
> Queue or User limits etc. In a production cluster, some periodic running 
> applications may have lesser cluster share. So after waiting for some time, 
> if resources are not available, such applications can be made as failed.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to