NodeManager doesn't recognize kill -9 of AM container
-----------------------------------------------------
Key: MAPREDUCE-3072
URL: https://issues.apache.org/jira/browse/MAPREDUCE-3072
Project: Hadoop Map/Reduce
Issue Type: Bug
Components: nodemanager
Affects Versions: 0.23.0
Environment: [criccomi@criccomi-ld trunk]$ svn info
Path: .
URL: http://svn.apache.org/repos/asf/hadoop/common/trunk
Repository Root: http://svn.apache.org/repos/asf
Repository UUID: 13f79535-47bb-0310-9956-ffa450edef68
Revision: 1174189
Node Kind: directory
Schedule: normal
Last Changed Author: szetszwo
Last Changed Rev: 1173990
Last Changed Date: 2011-09-22 01:25:20 -0700 (Thu, 22 Sep 2011)
Reporter: Chris Riccomini
If I kill -9 my application master's pid, the NM continues reporting that the
container is running. I assume it should probably instead report back to the RM
that the AM has died. Instead, it continues sending this status:
2011-09-22 09:33:13,352 INFO nodemanager.NodeStatusUpdaterImpl
(NodeStatusUpdaterImpl.java:getNodeStatus(222)) - Sending out status for
container: container_id {, app_attempt_id {, application_id {, id: 1,
cluster_timestamp: 1316707951832, }, attemptId: 1, }, id: 1, }, state:
C_RUNNING, diagnostics: "\n", exit_status: -1000,
2011-09-22 09:33:13,682 INFO monitor.ContainersMonitorImpl
(ContainersMonitorImpl.java:run(402)) - Memory usage of ProcessTree 27263 for
container-id container_1316707951832_0001_01_000001 : Virtual 0 bytes, limit :
2147483648 bytes; Physical 0 bytes, limit -1 bytes
This status keeps being sent forever.
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira