[
https://issues.apache.org/jira/browse/YARN-3518?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14527100#comment-14527100
]
Vinod Kumar Vavilapalli commented on YARN-3518:
-----------------------------------------------
We need to be careful here. Clients from gateway machines should be treated
separately from AMs - a distinction we don't have today. It actually makes
sense for clients to retry for a longer time than is usual for AMs.
> default rm/am expire interval should not less than default resourcemanager
> connect wait time
> --------------------------------------------------------------------------------------------
>
> Key: YARN-3518
> URL: https://issues.apache.org/jira/browse/YARN-3518
> Project: Hadoop YARN
> Issue Type: Bug
> Components: nodemanager, resourcemanager
> Reporter: sandflee
> Assignee: sandflee
> Labels: configuration, newbie
> Attachments: YARN-3518.001.patch
>
>
> take am for example, if am can't connect to RM, after am expire (600s), RM
> relaunch am, and there will be two am at the same time util resourcemanager
> connect max wait time(900s) passed.
> DEFAULT_RESOURCEMANAGER_CONNECT_MAX_WAIT_MS = 15 * 60 * 1000;
> DEFAULT_RM_AM_EXPIRY_INTERVAL_MS = 600000;
> DEFAULT_RM_NM_EXPIRY_INTERVAL_MS = 600000;
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)