[jira] [Updated] (HDFS-9678) Standby NN sometimes does not clear needRollbackFsImage
[ https://issues.apache.org/jira/browse/HDFS-9678?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Steve Loughran updated HDFS-9678: - Target Version/s: 2.7.8 (was: 2.7.7) > Standby NN sometimes does not clear needRollbackFsImage > --- > > Key: HDFS-9678 > URL: https://issues.apache.org/jira/browse/HDFS-9678 > Project: Hadoop HDFS > Issue Type: Bug >Reporter: Kihwal Lee >Priority: Major > > When the edit log loader sees {{OP_ROLLING_UPGRADE_START}}, it calls > {{setNeedRollbackFsImage(true)}}. This is cleared on a standby NN only by the > checkpointer thread when it actually creates a rollback image. > On {{OP_ROLLING_UPGRADE_FINALIZE}}, the rolling upgrade is finalized, but > {{needRollbackFsImage}} is not cleared, if a rollback image was never > created. This result in perpetual checkpointing by the standby NN. > The standby NN thinks it needs to do chekpointing because it needs to create > a rollback image, but since it is not in upgrade mode, it creates a regular > checkpoint, not a rollback image. As a result, the status is not cleared even > after creating checkpoint. > The standby will keep checkpointing back-to-back and they will get uploaded > to the active constantly. We noticed this because of increased sync time on > the active. -- This message was sent by Atlassian JIRA (v7.6.3#76005) - To unsubscribe, e-mail: hdfs-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: hdfs-issues-h...@hadoop.apache.org
[jira] [Updated] (HDFS-9678) Standby NN sometimes does not clear needRollbackFsImage
[ https://issues.apache.org/jira/browse/HDFS-9678?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinod Kumar Vavilapalli updated HDFS-9678: -- Target Version/s: 2.7.4 (was: 2.7.3) 2.7.3 is under release process, changing target-version to 2.7.4. > Standby NN sometimes does not clear needRollbackFsImage > --- > > Key: HDFS-9678 > URL: https://issues.apache.org/jira/browse/HDFS-9678 > Project: Hadoop HDFS > Issue Type: Bug >Reporter: Kihwal Lee > > When the edit log loader sees {{OP_ROLLING_UPGRADE_START}}, it calls > {{setNeedRollbackFsImage(true)}}. This is cleared on a standby NN only by the > checkpointer thread when it actually creates a rollback image. > On {{OP_ROLLING_UPGRADE_FINALIZE}}, the rolling upgrade is finalized, but > {{needRollbackFsImage}} is not cleared, if a rollback image was never > created. This result in perpetual checkpointing by the standby NN. > The standby NN thinks it needs to do chekpointing because it needs to create > a rollback image, but since it is not in upgrade mode, it creates a regular > checkpoint, not a rollback image. As a result, the status is not cleared even > after creating checkpoint. > The standby will keep checkpointing back-to-back and they will get uploaded > to the active constantly. We noticed this because of increased sync time on > the active. -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: hdfs-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: hdfs-issues-h...@hadoop.apache.org