Chengbing Liu updated YARN-2997:
    Attachment: YARN-2997.4.patch

Updated patch.

The testing-only method is removed. {{pendingCompletedContainers.clear()}} is 
added at the end of {{removeOrTrackCompletedContainersFromContext}}, and also 
in RESYNC section to clear the cache so that these outdated container statuses 
will not be reported to the restarted RM.

> NM keeps sending finished containers to RM until app is finished
> ----------------------------------------------------------------
>                 Key: YARN-2997
>                 URL: https://issues.apache.org/jira/browse/YARN-2997
>             Project: Hadoop YARN
>          Issue Type: Bug
>          Components: nodemanager
>    Affects Versions: 2.6.0
>            Reporter: Chengbing Liu
>            Assignee: Chengbing Liu
>         Attachments: YARN-2997.2.patch, YARN-2997.3.patch, YARN-2997.4.patch, 
> YARN-2997.patch
> We have seen in RM log a lot of
> {quote}
> org.apache.hadoop.yarn.server.resourcemanager.scheduler.fair.FairScheduler: 
> Null container completed...
> {quote}
> It is caused by NM sending completed containers repeatedly until the app is 
> finished. On the RM side, the container is already released, hence 
> {{getRMContainer}} returns null.

This message was sent by Atlassian JIRA

Reply via email to