[ 
https://issues.apache.org/jira/browse/YARN-445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13631761#comment-13631761
 ] 

Jason Lowe commented on YARN-445:
---------------------------------

Yes, it's an enhancement request to the NM API.  I filed it as signaling 
containers to generalize the jstack-on-task-timeout feature, at least in the 
UNIX sense.  I'm not familiar with the Windows APIs, so I'm not sure how (or 
if) signals map on that platform.  I could see going three different ways on 
this for the NM API:

# methods to trigger various features specific to JVMs like jstack, jmap, etc.
# methods to send generalized signals (if there is a reasonable facsimile on 
Windows)
# give up trying to generalize the concept and put in the StopContainerRequest 
flag

I'd prefer the generalized signal approach if we can come up with a reasonable 
mapping for Windows, as this could be useful for non-JVM containers.  In any 
case, we've had a lot of requests for the ability to trigger jstacks on 
containers in various situations, so I'd like to see at least something done in 
the NM API to achieve this.
                
> Ability to signal containers
> ----------------------------
>
>                 Key: YARN-445
>                 URL: https://issues.apache.org/jira/browse/YARN-445
>             Project: Hadoop YARN
>          Issue Type: Sub-task
>          Components: nodemanager
>    Affects Versions: 2.0.5-beta
>            Reporter: Jason Lowe
>
> It would be nice if an ApplicationMaster could send signals to contaniers 
> such as SIGQUIT, SIGUSR1, etc.
> For example, in order to replicate the jstack-on-task-timeout feature 
> implemented by MAPREDUCE-1119 in Hadoop 0.21 the NodeManager needs an 
> interface for sending SIGQUIT to a container.  For that specific feature we 
> could implement it as an additional field in the StopContainerRequest.  
> However that would not address other potential features like the ability for 
> an AM to trigger jstacks on arbitrary tasks *without* killing them.  The 
> latter feature would be a very useful debugging tool for users who do not 
> have shell access to the nodes.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to