[ 
https://issues.apache.org/jira/browse/TEZ-3128?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Siddharth Seth updated TEZ-3128:
--------------------------------
    Attachment: amJstack

Here's the stack trace. TezContainerLauncher invokes shutdownAllContainers - 
which seems to try and kill the containers - one at a time - before shutdown.

Releasing containers is sufficient. [~hitesh] - last I spoke to you about this, 
I got the impression that a stop container was required for some reason. If 
that understanding was incorrect, we should be able to avoid the container 
stop. Do we even need to release containers ? If the app is shutting down, YARN 
should take care of this on it's own - once the application unregisters ? (or 
is that where the problem is - the unregistration happens at the end and hence 
we should release the containers early).
IIRC, even for unregistration, YARN allows an app to unregister early - and 
will not kill the AM for a certain amount of time after that.

> Avoid stopping containers on the AM shutdown thread
> ---------------------------------------------------
>
>                 Key: TEZ-3128
>                 URL: https://issues.apache.org/jira/browse/TEZ-3128
>             Project: Apache Tez
>          Issue Type: Bug
>    Affects Versions: 0.8.0-alpha
>            Reporter: Siddharth Seth
>            Assignee: Tsuyoshi Ozawa
>              Labels: newbie
>         Attachments: TEZ-3128.001.patch, amJstack
>
>
> During an AM shutdown, the TaskCommunicator is also shutdown and it tries to 
> stop containers in the shutdown thread itself. This can cause the AM shutdown 
> to block if NMs are not available.
> This likely affects 0.7 as well.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to