[ 
https://issues.apache.org/jira/browse/HDFS-10536?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15333878#comment-15333878
 ] 

XingFeng Shen commented on HDFS-10536:
--------------------------------------

Reproduce steps:
1) stop two zkfc
2) restart active nn, then nn will become standby.
3) we can check the standby NN logs which will throw the exception.
4) start two zkfc, nn will become ative nn. Standby NN will also throw 
exception again.


> Standby NN can not trigger log roll after EditLogTailer thread failed 3 times 
> in EditLogTailer.triggerActiveLogRoll method.
> ---------------------------------------------------------------------------------------------------------------------------
>
>                 Key: HDFS-10536
>                 URL: https://issues.apache.org/jira/browse/HDFS-10536
>             Project: Hadoop HDFS
>          Issue Type: Bug
>          Components: auto-failover
>            Reporter: XingFeng Shen
>            Priority: Critical
>         Attachments: HDFS-10536.patch
>
>
> When all NameNodes become standby, EditLogTailer will retry 3 times to 
> trigger log roll, then it will be failed and throw Exception "Cannot find any 
> valid remote NN to service request!". After one namenode become active, 
> standby NN still can not trigger log roll again because variable 
> "nnLoopCount" is still 3, it can not init to 0.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to