[
https://issues.apache.org/jira/browse/YARN-4464?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15385192#comment-15385192
]
Karthik Kambatla commented on YARN-4464:
----------------------------------------
I know there is no right answer here. We should have picked a better default to
begin with.
IAC, my preference would be whatever least astonishes the admins/users. Options
sorted by least astonishment:
# Don't change anything. Keep it at 10,000 and deal with recovery slowness etc.
# Change it to 0. When people try out Hadoop 3 and failover, they immediately
realize they don't see any completed applications. However, they all will
likely have to change it
# Change it to 1000. People will realize it late, but most users might not
necessarily run into any issues ever.
By the way, one other change we should make is to limit
{{rm.store.max-completed-apps}} to {{rm.max-completed-apps}}.
> default value of yarn.resourcemanager.state-store.max-completed-applications
> should lower.
> ------------------------------------------------------------------------------------------
>
> Key: YARN-4464
> URL: https://issues.apache.org/jira/browse/YARN-4464
> Project: Hadoop YARN
> Issue Type: Wish
> Components: resourcemanager
> Reporter: KWON BYUNGCHANG
> Assignee: Daniel Templeton
> Priority: Blocker
> Attachments: YARN-4464.001.patch, YARN-4464.002.patch,
> YARN-4464.003.patch, YARN-4464.004.patch
>
>
> my cluster has 120 nodes.
> I configured RM Restart feature.
> {code}
> yarn.resourcemanager.recovery.enabled=true
> yarn.resourcemanager.store.class=org.apache.hadoop.yarn.server.resourcemanager.recovery.FileSystemRMStateStore
> yarn.resourcemanager.fs.state-store.uri=/system/yarn/rmstore
> {code}
> unfortunately I did not configure
> {{yarn.resourcemanager.state-store.max-completed-applications}}.
> so that property configured default value 10,000.
> I have restarted RM due to changing another configuartion.
> I expected that RM restart immediately.
> recovery process was very slow. I have waited about 20min.
> realize missing
> {{yarn.resourcemanager.state-store.max-completed-applications}}.
> its default value is very huge.
> need to change lower value or document notice on [RM Restart
> page|http://hadoop.apache.org/docs/stable/hadoop-yarn/hadoop-yarn-site/ResourceManagerRestart.html].
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]