[jira] [Updated] (TEZ-3128) Avoid stopping containers on the AM shutdown thread

2016-02-26 Thread Jason Lowe (JIRA)

 [ 
https://issues.apache.org/jira/browse/TEZ-3128?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jason Lowe updated TEZ-3128:

 Hadoop Flags: Reviewed
Fix Version/s: 0.7.1

bq. should this be pulled into 0.7 as well ?

Yes, I committed this to branch-0.7.  Thanks!


> Avoid stopping containers on the AM shutdown thread
> ---
>
> Key: TEZ-3128
> URL: https://issues.apache.org/jira/browse/TEZ-3128
> Project: Apache Tez
>  Issue Type: Bug
>Affects Versions: 0.8.0-alpha
>Reporter: Siddharth Seth
>Assignee: Tsuyoshi Ozawa
>  Labels: newbie
> Fix For: 0.7.1, 0.8.3
>
> Attachments: TEZ-3128.001.patch, TEZ-3128.002.patch, 
> TEZ-3128.003.patch, TEZ-3128.004.patch, amJstack
>
>
> During an AM shutdown, the TaskCommunicator is also shutdown and it tries to 
> stop containers in the shutdown thread itself. This can cause the AM shutdown 
> to block if NMs are not available.
> This likely affects 0.7 as well.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (TEZ-3128) Avoid stopping containers on the AM shutdown thread

2016-02-26 Thread Tsuyoshi Ozawa (JIRA)

 [ 
https://issues.apache.org/jira/browse/TEZ-3128?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tsuyoshi Ozawa updated TEZ-3128:

Attachment: TEZ-3128.004.patch

Attaching v04 patch to remove a needless comment.

> Avoid stopping containers on the AM shutdown thread
> ---
>
> Key: TEZ-3128
> URL: https://issues.apache.org/jira/browse/TEZ-3128
> Project: Apache Tez
>  Issue Type: Bug
>Affects Versions: 0.8.0-alpha
>Reporter: Siddharth Seth
>Assignee: Tsuyoshi Ozawa
>  Labels: newbie
> Attachments: TEZ-3128.001.patch, TEZ-3128.002.patch, 
> TEZ-3128.003.patch, TEZ-3128.004.patch, amJstack
>
>
> During an AM shutdown, the TaskCommunicator is also shutdown and it tries to 
> stop containers in the shutdown thread itself. This can cause the AM shutdown 
> to block if NMs are not available.
> This likely affects 0.7 as well.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (TEZ-3128) Avoid stopping containers on the AM shutdown thread

2016-02-24 Thread Tsuyoshi Ozawa (JIRA)

 [ 
https://issues.apache.org/jira/browse/TEZ-3128?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tsuyoshi Ozawa updated TEZ-3128:

Attachment: TEZ-3128.002.patch

Sidd and Hitesh, thank for the clarification. 

Created the patch to make the kill faster when shutdown by making the interval 
of NMProxy's retry and the timeout of creating connection smaller. 

> Avoid stopping containers on the AM shutdown thread
> ---
>
> Key: TEZ-3128
> URL: https://issues.apache.org/jira/browse/TEZ-3128
> Project: Apache Tez
>  Issue Type: Bug
>Affects Versions: 0.8.0-alpha
>Reporter: Siddharth Seth
>Assignee: Tsuyoshi Ozawa
>  Labels: newbie
> Attachments: TEZ-3128.001.patch, TEZ-3128.002.patch, amJstack
>
>
> During an AM shutdown, the TaskCommunicator is also shutdown and it tries to 
> stop containers in the shutdown thread itself. This can cause the AM shutdown 
> to block if NMs are not available.
> This likely affects 0.7 as well.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (TEZ-3128) Avoid stopping containers on the AM shutdown thread

2016-02-23 Thread Siddharth Seth (JIRA)

 [ 
https://issues.apache.org/jira/browse/TEZ-3128?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Siddharth Seth updated TEZ-3128:

Attachment: amJstack

Here's the stack trace. TezContainerLauncher invokes shutdownAllContainers - 
which seems to try and kill the containers - one at a time - before shutdown.

Releasing containers is sufficient. [~hitesh] - last I spoke to you about this, 
I got the impression that a stop container was required for some reason. If 
that understanding was incorrect, we should be able to avoid the container 
stop. Do we even need to release containers ? If the app is shutting down, YARN 
should take care of this on it's own - once the application unregisters ? (or 
is that where the problem is - the unregistration happens at the end and hence 
we should release the containers early).
IIRC, even for unregistration, YARN allows an app to unregister early - and 
will not kill the AM for a certain amount of time after that.

> Avoid stopping containers on the AM shutdown thread
> ---
>
> Key: TEZ-3128
> URL: https://issues.apache.org/jira/browse/TEZ-3128
> Project: Apache Tez
>  Issue Type: Bug
>Affects Versions: 0.8.0-alpha
>Reporter: Siddharth Seth
>Assignee: Tsuyoshi Ozawa
>  Labels: newbie
> Attachments: TEZ-3128.001.patch, amJstack
>
>
> During an AM shutdown, the TaskCommunicator is also shutdown and it tries to 
> stop containers in the shutdown thread itself. This can cause the AM shutdown 
> to block if NMs are not available.
> This likely affects 0.7 as well.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (TEZ-3128) Avoid stopping containers on the AM shutdown thread

2016-02-23 Thread Siddharth Seth (JIRA)

 [ 
https://issues.apache.org/jira/browse/TEZ-3128?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Siddharth Seth updated TEZ-3128:

Target Version/s: 0.8.3

> Avoid stopping containers on the AM shutdown thread
> ---
>
> Key: TEZ-3128
> URL: https://issues.apache.org/jira/browse/TEZ-3128
> Project: Apache Tez
>  Issue Type: Bug
>Affects Versions: 0.8.0-alpha
>Reporter: Siddharth Seth
>Assignee: Tsuyoshi Ozawa
>  Labels: newbie
> Attachments: TEZ-3128.001.patch
>
>
> During an AM shutdown, the TaskCommunicator is also shutdown and it tries to 
> stop containers in the shutdown thread itself. This can cause the AM shutdown 
> to block if NMs are not available.
> This likely affects 0.7 as well.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (TEZ-3128) Avoid stopping containers on the AM shutdown thread

2016-02-21 Thread Tsuyoshi Ozawa (JIRA)

 [ 
https://issues.apache.org/jira/browse/TEZ-3128?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tsuyoshi Ozawa updated TEZ-3128:

Attachment: TEZ-3128.001.patch

Attaching a patch to make the YarnTaskSchedulerService's timeout for shutdown 
smaller. 

> Avoid stopping containers on the AM shutdown thread
> ---
>
> Key: TEZ-3128
> URL: https://issues.apache.org/jira/browse/TEZ-3128
> Project: Apache Tez
>  Issue Type: Bug
>Affects Versions: 0.8.0-alpha
>Reporter: Siddharth Seth
>  Labels: newbie
> Attachments: TEZ-3128.001.patch
>
>
> During an AM shutdown, the TaskCommunicator is also shutdown and it tries to 
> stop containers in the shutdown thread itself. This can cause the AM shutdown 
> to block if NMs are not available.
> This likely affects 0.7 as well.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (TEZ-3128) Avoid stopping containers on the AM shutdown thread

2016-02-19 Thread Hitesh Shah (JIRA)

 [ 
https://issues.apache.org/jira/browse/TEZ-3128?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Hitesh Shah updated TEZ-3128:
-
Labels: newbie  (was: )

> Avoid stopping containers on the AM shutdown thread
> ---
>
> Key: TEZ-3128
> URL: https://issues.apache.org/jira/browse/TEZ-3128
> Project: Apache Tez
>  Issue Type: Bug
>Affects Versions: 0.8.0-alpha
>Reporter: Siddharth Seth
>  Labels: newbie
>
> During an AM shutdown, the TaskCommunicator is also shutdown and it tries to 
> stop containers in the shutdown thread itself. This can cause the AM shutdown 
> to block if NMs are not available.
> This likely affects 0.7 as well.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)