[ 
https://issues.apache.org/jira/browse/TEZ-2300?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14702216#comment-14702216
 ] 

Bikas Saha commented on TEZ-2300:
---------------------------------

The current patch is useful because it ensures that the app is killed after 
some max deadline.

In addition to that, if we want to ensure ATS is flushed by keeping the AM 
alive, we could, in shutdownTezAM
1) send release containers signal to the scheduler (this will reduce resource 
usage)
2) ensure DAG Kill is initiated (may already be happening but Rohini mentioned 
she saw allocations happen during this time)
3) call stop() to asynchronously stop (this includes flush to ATS)
And return.

Thoughts?


> TezClient.stop() takes a lot of time or does not work sometimes
> ---------------------------------------------------------------
>
>                 Key: TEZ-2300
>                 URL: https://issues.apache.org/jira/browse/TEZ-2300
>             Project: Apache Tez
>          Issue Type: Bug
>            Reporter: Rohini Palaniswamy
>            Assignee: Jonathan Eagles
>         Attachments: TEZ-2300.1.patch, TEZ-2300.2.patch, TEZ-2300.3.patch, 
> TEZ-2300.4.patch, syslog_dag_1428329756093_325099_1_post 
>
>
>   Noticed this with a couple of pig scripts which were not behaving well (AM 
> close to OOM, etc) and even with some that were running fine. Pig calls 
> Tezclient.stop() in shutdown hook. Ctrl+C to the pig script either exits 
> immediately or is hung. In both cases it either takes a long time for the 
> yarn application to go to KILLED state. Many times I just end up calling yarn 
> application -kill separately after waiting for 5 mins or more for it to get 
> killed.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to