[
https://issues.apache.org/jira/browse/MAPREDUCE-3339?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Vinod Kumar Vavilapalli updated MAPREDUCE-3339:
-----------------------------------------------
Attachment: MAPREDUCE-3339-20111220.txt
Attaching patch with trivial update to the configuration names, changed to
{{job.node-blacklisting.enable}} and
{{job.node-blacklisting.ignore-threshold-node-percent}}.
> Job is getting hanged indefinitely,if the child processes are killed on the
> NM. KILL_CONTAINER eventtype is continuosly sent to the containers that are
> not existing
> ---------------------------------------------------------------------------------------------------------------------------------------------------------------------
>
> Key: MAPREDUCE-3339
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-3339
> Project: Hadoop Map/Reduce
> Issue Type: Bug
> Components: mrv2
> Affects Versions: 0.23.0
> Reporter: Ramgopal N
> Assignee: Siddharth Seth
> Priority: Blocker
> Fix For: 0.23.1
>
> Attachments: MAPREDUCE-3339-20111220.txt, MR3339_v1.txt, MR3339_v2.txt
>
>
> I have only one NM running.
> I have submitted a job and all the child processes on the NM got killed
> continuosly.This made the Job to hang indefinitely.
> In the NM logs it is logging WARN message
> :org.apache.hadoop.yarn.server.nodemanager.containermanager.ContainerManagerImpl:
> Event EventType: KILL_CONTAINER sent to absent container
> container_1320301910500_0004_01_001359
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira