[
https://issues.apache.org/jira/browse/HDFS-10536?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15339267#comment-15339267
]
Vinayakumar B commented on HDFS-10536:
--------------------------------------
[~xingfengshen], Thanks for the fix,
1. Fix looks good.
2. Please add a MiniDFSCluster test.
Scenario is simple. Have a cluster with 3 namenodes, all in standby. And
verify that periodic editlog trailing happens.
3. Hit the "Submit patch" once the patch is uploaded.
> Standby NN can not trigger log roll after EditLogTailer thread failed 3 times
> in EditLogTailer.triggerActiveLogRoll method.
> ---------------------------------------------------------------------------------------------------------------------------
>
> Key: HDFS-10536
> URL: https://issues.apache.org/jira/browse/HDFS-10536
> Project: Hadoop HDFS
> Issue Type: Bug
> Components: auto-failover
> Reporter: XingFeng Shen
> Priority: Critical
> Attachments: HDFS-10536.patch
>
>
> When all NameNodes become standby, EditLogTailer will retry 3 times to
> trigger log roll, then it will be failed and throw Exception "Cannot find any
> valid remote NN to service request!". After one namenode become active,
> standby NN still can not trigger log roll again because variable
> "nnLoopCount" is still 3, it can not init to 0.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]