Zhizhen Hou created MAPREDUCE-7187:
--------------------------------------
Summary:
RMContainerAllocator.ScheduledRequests#getContainerReqToReplace may not find a
task when the priority of container is PRIORITY_MAP
Key: MAPREDUCE-7187
URL: https://issues.apache.org/jira/browse/MAPREDUCE-7187
Project: Hadoop Map/Reduce
Issue Type: Bug
Components: applicationmaster
Affects Versions: 2.7.5
Reporter: Zhizhen Hou
The resource manager may has allocated a map container on a host ("h1" for
example) for a application, and the container has not been fetched by the
MRAppMaster. At this time, the MRAppMaster receives a task fail event, and the
task is on host h1. The event cause the h1 blacklisted. Now the MRAppMaster
send a heartbeat, and receive a container on h1. The MRAppMaster can not assign
the container since it is on a blacklisted host. The #getContainerReqToReplace
fails returning another task, may cause a map task hang forever.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]