[ 
https://issues.apache.org/jira/browse/MAPREDUCE-7187?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16898372#comment-16898372
 ] 

Wei-Chiu Chuang commented on MAPREDUCE-7187:
--------------------------------------------

[~jiwq] can you help review the patch?

> RMContainerAllocator.ScheduledRequests#getContainerReqToReplace may not find 
> a task when the priority of container is PRIORITY_MAP
> ----------------------------------------------------------------------------------------------------------------------------------
>
>                 Key: MAPREDUCE-7187
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-7187
>             Project: Hadoop Map/Reduce
>          Issue Type: Bug
>          Components: applicationmaster
>    Affects Versions: 2.7.5, 3.1.1, 2.9.2
>            Reporter: Zhizhen Hou
>            Priority: Major
>         Attachments: MAPREDUCE-7187.001.patch
>
>
> The resource manager may has allocated a map container on a host ("h1" for 
> example) for a application, and the container has not been fetched by the 
> MRAppMaster. At this time, the  MRAppMaster receives a task fail event, and 
> the task is on host h1. The event cause the h1 blacklisted. Now the 
> MRAppMaster send a heartbeat, and receive a container on h1. The MRAppMaster 
> can not assign the container since it is on a blacklisted host. The 
> #getContainerReqToReplace fails returning  another task, may cause a map task 
> hang forever.



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to