[
https://issues.apache.org/jira/browse/YARN-5773?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15618682#comment-15618682
]
Bibin A Chundatt edited comment on YARN-5773 at 10/29/16 8:25 PM:
------------------------------------------------------------------
{noformat}
JDK v1.8.0_111 Failed junit tests
hadoop.yarn.server.resourcemanager.TestClientRMTokens
hadoop.yarn.server.resourcemanager.TestAMAuthorization
JDK v1.7.0_111 Failed junit tests
hadoop.yarn.server.resourcemanager.TestClientRMTokens
hadoop.yarn.server.resourcemanager.TestAMAuthorization
{noformat}
IIUC these case are due to hostname resolution.
was (Author: bibinchundatt):
{format}
JDK v1.8.0_111 Failed junit tests
hadoop.yarn.server.resourcemanager.TestClientRMTokens
hadoop.yarn.server.resourcemanager.TestAMAuthorization
JDK v1.7.0_111 Failed junit tests
hadoop.yarn.server.resourcemanager.TestClientRMTokens
hadoop.yarn.server.resourcemanager.TestAMAuthorization
{format}
IIUC these case are due to hostname resolution.
> RM recovery too slow due to LeafQueue#activateApplication()
> -----------------------------------------------------------
>
> Key: YARN-5773
> URL: https://issues.apache.org/jira/browse/YARN-5773
> Project: Hadoop YARN
> Issue Type: Bug
> Components: capacity scheduler, rolling upgrade
> Reporter: Bibin A Chundatt
> Assignee: Bibin A Chundatt
> Priority: Critical
> Labels: oct16-medium
> Attachments: YARN-5773-branch-2.8.0001.patch, YARN-5773.0001.patch,
> YARN-5773.0002.patch, YARN-5773.0004.patch, YARN-5773.0005.patch,
> YARN-5773.0006.patch, YARN-5773.0007.patch, YARN-5773.0008.patch,
> YARN-5773.0009.patch
>
>
> # Submit application 10K application to default queue.
> # All applications are in accepted state
> # Now restart resourcemanager
> For each application recovery {{LeafQueue#activateApplications()}} is
> invoked.Resulting in AM limit check to be done even before Node managers are
> getting registered.
> Total iteration for N application is about {{N(N+1)/2}} for {{10K}}
> application {{50000000}} iterations causing time take for Rm to be active
> more than 10 min.
> Since NM resources are not yet added to during recovery we should skip
> {{activateApplicaiton()}}
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]