[ 
https://issues.apache.org/jira/browse/MAPREDUCE-7187?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Zhizhen Hou updated MAPREDUCE-7187:
-----------------------------------
    Attachment:     (was: MAPREDUCE-7187-1.patch)

> RMContainerAllocator.ScheduledRequests#getContainerReqToReplace may not find 
> a task when the priority of container is PRIORITY_MAP
> ----------------------------------------------------------------------------------------------------------------------------------
>
>                 Key: MAPREDUCE-7187
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-7187
>             Project: Hadoop Map/Reduce
>          Issue Type: Bug
>          Components: applicationmaster
>    Affects Versions: 2.7.5, 3.1.1, 2.9.2
>            Reporter: Zhizhen Hou
>            Priority: Major
>         Attachments: MAPREDUCE-7187.001.patch
>
>
> The resource manager may has allocated a map container on a host ("h1" for 
> example) for a application, and the container has not been fetched by the 
> MRAppMaster. At this time, the  MRAppMaster receives a task fail event, and 
> the task is on host h1. The event cause the h1 blacklisted. Now the 
> MRAppMaster send a heartbeat, and receive a container on h1. The MRAppMaster 
> can not assign the container since it is on a blacklisted host. The 
> #getContainerReqToReplace fails returning  another task, may cause a map task 
> hang forever.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to