[
https://issues.apache.org/jira/browse/HDFS-14201?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16865755#comment-16865755
]
Íñigo Goiri commented on HDFS-14201:
------------------------------------
Thanks [~hexiaoqiao] for the update.
A couple nits:
* Can we use LambdaTestUtils#intercept in
{{testNNHealthCheckWithSafemodeAsUnhealthy()}}?
* Should we fail if we pass the transition in
{{testTransitionToActiveWhenSafeMode()}}? Maybe also use LambdaTestUtils.
* In hdfs-default.xml, let's explain the purpose with something like: "This
will prevent safe mode namenodes to become active while other standby namenodes
might be ready to serve requests.".
* Should we extend the documentation in the NN HA section?
> Ability to disallow safemode NN to become active
> ------------------------------------------------
>
> Key: HDFS-14201
> URL: https://issues.apache.org/jira/browse/HDFS-14201
> Project: Hadoop HDFS
> Issue Type: Improvement
> Components: auto-failover
> Affects Versions: 3.1.1, 2.9.2
> Reporter: Xiao Liang
> Assignee: Xiao Liang
> Priority: Major
> Attachments: HDFS-14201.001.patch, HDFS-14201.002.patch,
> HDFS-14201.003.patch, HDFS-14201.004.patch, HDFS-14201.005.patch,
> HDFS-14201.006.patch, HDFS-14201.007.patch
>
>
> Currently with HA, Namenode in safemode can be possibly selected as active,
> for availability of both read and write, Namenodes not in safemode are better
> choices to become active though.
> It can take tens of minutes for a cold started Namenode to get out of
> safemode, especially when there are large number of files and blocks in HDFS,
> that means if a Namenode in safemode become active, the cluster will be not
> fully functioning for quite a while, even if it can while there is some
> Namenode not in safemode.
> The proposal here is to add an option, to allow Namenode to report itself as
> UNHEALTHY to ZKFC, if it's in safemode, so as to only allow fully functioning
> Namenode to become active, improving the general availability of the cluster.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]