Zhe Zhang created HDFS-11709:
--------------------------------
Summary: StandbyCheckpointer should handle an non-existing
legacyOivImageDir gracefully
Key: HDFS-11709
URL: https://issues.apache.org/jira/browse/HDFS-11709
Project: Hadoop HDFS
Issue Type: Bug
Components: ha, namenode
Affects Versions: 2.6.1
Reporter: Zhe Zhang
Assignee: Erik Krogen
Priority: Critical
In {{StandbyCheckpointer}}, if the legacy OIV directory is not properly
created, or was deleted for some reason (e.g. mis-operation), all checkpoint
ops will fall. Not only the ANN won't receive new fsimages, the JNs will get
full with edit log files, and cause NN to crash.
{code}
// Save the legacy OIV image, if the output dir is defined.
String outputDir = checkpointConf.getLegacyOivImageDir();
if (outputDir != null && !outputDir.isEmpty()) {
img.saveLegacyOIVImage(namesystem, outputDir, canceler);
}
{code}
It doesn't make sense to let such an unimportant part (saving OIV) abort all
checkpoints and cause NN crash (and possibly lose data).
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]