Zhe Zhang created HDFS-11709: -------------------------------- Summary: StandbyCheckpointer should handle an non-existing legacyOivImageDir gracefully Key: HDFS-11709 URL: https://issues.apache.org/jira/browse/HDFS-11709 Project: Hadoop HDFS Issue Type: Bug Components: ha, namenode Affects Versions: 2.6.1 Reporter: Zhe Zhang Assignee: Erik Krogen Priority: Critical
In {{StandbyCheckpointer}}, if the legacy OIV directory is not properly created, or was deleted for some reason (e.g. mis-operation), all checkpoint ops will fall. Not only the ANN won't receive new fsimages, the JNs will get full with edit log files, and cause NN to crash. {code} // Save the legacy OIV image, if the output dir is defined. String outputDir = checkpointConf.getLegacyOivImageDir(); if (outputDir != null && !outputDir.isEmpty()) { img.saveLegacyOIVImage(namesystem, outputDir, canceler); } {code} It doesn't make sense to let such an unimportant part (saving OIV) abort all checkpoints and cause NN crash (and possibly lose data). -- This message was sent by Atlassian JIRA (v6.3.15#6346) --------------------------------------------------------------------- To unsubscribe, e-mail: hdfs-dev-unsubscr...@hadoop.apache.org For additional commands, e-mail: hdfs-dev-h...@hadoop.apache.org