[ 
https://issues.apache.org/jira/browse/YARN-8206?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16461556#comment-16461556
 ] 

Eric Badger edited comment on YARN-8206 at 5/2/18 7:57 PM:
-----------------------------------------------------------

Hey [[email protected]], thanks for the review. I fixed up the first 3 
comments that you mentioned in patch 003. I'm wondering, however, for 4, is 
that a regression? Should that be handled in a separate JIRA? I'm looking 
through the code and I thought that error message had always been a problem. 
I'm not super opposed to putting it in this JIRA, just want to make sure I'm on 
the same page


was (Author: ebadger):
Hey [[email protected]], thanks for the review. I fixed up the first 3 
comments that you mentioned. I'm wondering, however, for 4, is that a 
regression? Should that be handled in a separate JIRA? I'm looking through the 
code and I thought that error message had always been a problem. I'm not super 
opposed to putting it in this JIRA, just want to make sure I'm on the same page

> Sending a kill does not immediately kill docker containers
> ----------------------------------------------------------
>
>                 Key: YARN-8206
>                 URL: https://issues.apache.org/jira/browse/YARN-8206
>             Project: Hadoop YARN
>          Issue Type: Sub-task
>            Reporter: Eric Badger
>            Assignee: Eric Badger
>            Priority: Major
>         Attachments: YARN-8206.001.patch, YARN-8206.002.patch, 
> YARN-8206.003.patch
>
>
> {noformat}
>         if (ContainerExecutor.Signal.KILL.equals(signal)
>             || ContainerExecutor.Signal.TERM.equals(signal)) {
>           handleContainerStop(containerId, env);
> {noformat}
> Currently in the code, we are handling both SIGKILL and SIGTERM as equivalent 
> for docker containers. However, they should actually be separate. When YARN 
> sends a SIGKILL to a process, it means for it to die immediately and not sit 
> around waiting for anything. This ensures an immediate reclamation of 
> resources. Additionally, if a SIGTERM is sent before the SIGKILL, the task 
> might not handle the signal correctly, and will then end up as a failed task 
> instead of a killed task. This is especially bad for preemption. 



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to