[
https://issues.apache.org/jira/browse/MAPREDUCE-7187?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16785115#comment-16785115
]
Zhizhen Hou commented on MAPREDUCE-7187:
----------------------------------------
This is the first time to submit patch. I need some help.
> RMContainerAllocator.ScheduledRequests#getContainerReqToReplace may not find
> a task when the priority of container is PRIORITY_MAP
> ----------------------------------------------------------------------------------------------------------------------------------
>
> Key: MAPREDUCE-7187
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-7187
> Project: Hadoop Map/Reduce
> Issue Type: Bug
> Components: applicationmaster
> Affects Versions: 2.7.5, 3.1.1, 2.9.2
> Reporter: Zhizhen Hou
> Priority: Major
> Attachments: MAPREDUCE-7187-1.patch, MAPREDUCE-7187.001.patch
>
>
> The resource manager may has allocated a map container on a host ("h1" for
> example) for a application, and the container has not been fetched by the
> MRAppMaster. At this time, the MRAppMaster receives a task fail event, and
> the task is on host h1. The event cause the h1 blacklisted. Now the
> MRAppMaster send a heartbeat, and receive a container on h1. The MRAppMaster
> can not assign the container since it is on a blacklisted host. The
> #getContainerReqToReplace fails returning another task, may cause a map task
> hang forever.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]