[
https://issues.apache.org/jira/browse/YARN-479?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13623979#comment-13623979
]
Bikas Saha commented on YARN-479:
---------------------------------
Patch looks good overall!
why is this line showing as a diff
{code}
- List<ContainerStatus> containersStatuses = new
ArrayList<ContainerStatus>();
...
+ List<ContainerStatus> containersStatuses = new
ArrayList<ContainerStatus>();
{code}
why is this change needed?
{code}
public void testNMRegistration() throws InterruptedException {
+ final long connectionWaitSecs = 5;
+ final long connectionRetryIntervalSecs = 1;
+ YarnConfiguration conf = createNMConfig();
+ conf.setLong(YarnConfiguration.RESOURCEMANAGER_CONNECT_WAIT_SECS,
+ connectionWaitSecs);
+ conf.setLong(YarnConfiguration
+ .RESOURCEMANAGER_CONNECT_RETRY_INTERVAL_SECS,
+ connectionRetryIntervalSecs);
+
nm = new NodeManager() {
{code}
and this change needed?
{code}
@@ -527,7 +599,6 @@ protected NodeStatusUpdater createNodeStatusUpdater(Context
context,
}
};
- YarnConfiguration conf = createNMConfig();
nm.init(conf);
{code}
The message can be made part of the Assert
{code}
+ //calculate heartBeatCount based on connectionWaitSecs and
RetryIntervalSecs
+ Assert.assertTrue(heartBeatCount == 2);
{code}
Can we pass in the barrier etc into the custom derived classes of nodemanager,
rmservice etc so that we can avoid global vars?
> NM retry behavior for connection to RM should be similar for lost heartbeats
> ----------------------------------------------------------------------------
>
> Key: YARN-479
> URL: https://issues.apache.org/jira/browse/YARN-479
> Project: Hadoop YARN
> Issue Type: Bug
> Reporter: Hitesh Shah
> Assignee: Jian He
> Attachments: YARN-479.1.patch, YARN-479.2.patch, YARN-479.3.patch,
> YARN-479.4.patch, YARN-479.5.patch, YARN-479.6.patch, YARN-479.7.patch,
> YARN-479.8.patch
>
>
> Regardless of connection loss at the start or at an intermediate point, NM's
> retry behavior to the RM should follow the same flow.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira