[
https://issues.apache.org/jira/browse/YARN-9809?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17203346#comment-17203346
]
Eric Payne commented on YARN-9809:
----------------------------------
The latest branch-3.2 precommit build looks fine. The unit test failures are
the same ones that are failing on branch-3.2 without the patch _except_
{{TestRaceWhenRelogin}}, which is not failing for me in my local build with or
without the patch.
+1. I will commit this today.
> NMs should supply a health status when registering with RM
> ----------------------------------------------------------
>
> Key: YARN-9809
> URL: https://issues.apache.org/jira/browse/YARN-9809
> Project: Hadoop YARN
> Issue Type: Bug
> Reporter: Eric Badger
> Assignee: Eric Badger
> Priority: Major
> Fix For: 3.4.0
>
> Attachments: YARN-9809-branch-3.2.007.patch,
> YARN-9809-branch-3.2.008.patch, YARN-9809-branch-3.2.009.patch,
> YARN-9809.001.patch, YARN-9809.002.patch, YARN-9809.003.patch,
> YARN-9809.004.patch, YARN-9809.005.patch, YARN-9809.006.patch,
> YARN-9809.007.patch
>
>
> Currently if the NM registers with the RM and it is unhealthy, it can be
> scheduled many containers before the first heartbeat. After the first
> heartbeat, the RM will mark the NM as unhealthy and kill all of the
> containers.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]