[
https://issues.apache.org/jira/browse/HDFS-9787?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15140396#comment-15140396
]
Guocui Mi commented on HDFS-9787:
---------------------------------
Agree with you on this, should create one new workitem for this.
1> Non-primary SNNs check ANN checkpoint version via RPC;
2> Check if checkpoint period beyond threshold
Yes : try to upload checkpoint to ANN. become primary SNN if success.
3> repeat 1> 2>.
This patch is just a simple bug fix to make it work as expected first.
> SNNs stop uploading FSImage to ANN once isPrimaryCheckPointer changed to
> false.
> -------------------------------------------------------------------------------
>
> Key: HDFS-9787
> URL: https://issues.apache.org/jira/browse/HDFS-9787
> Project: Hadoop HDFS
> Issue Type: Bug
> Components: ha
> Affects Versions: 3.0.0
> Reporter: Guocui Mi
> Assignee: Guocui Mi
> Attachments: HDFS-9786-v000.patch
>
>
> SNNs stop uploading FSImage to ANN once isPrimaryCheckPointer become false.
> Here is the logic to check if upload FSImage or not.
> In StandbyCheckpointer.java
> boolean sendRequest = isPrimaryCheckPointer || secsSinceLast >=
> checkpointConf.getQuietPeriod();
> doCheckpoint(sendRequest);
> The sendRequest is always false if isPrimaryCheckPointer is false giving
> secsSinceLast (~checkpointPeriod) >= checkpointConf.getQuietPeriod()
> (checkpointPeriod * this.quietMultiplier(default value 1.5)) always returns
> false.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)