[
https://issues.apache.org/jira/browse/HDFS-11709?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15987057#comment-15987057
]
Erik Krogen commented on HDFS-11709:
------------------------------------
Thanks [~kihwal] and [~zhz]!
> StandbyCheckpointer should handle an non-existing legacyOivImageDir gracefully
> ------------------------------------------------------------------------------
>
> Key: HDFS-11709
> URL: https://issues.apache.org/jira/browse/HDFS-11709
> Project: Hadoop HDFS
> Issue Type: Bug
> Components: ha, namenode
> Affects Versions: 2.6.1
> Reporter: Zhe Zhang
> Assignee: Erik Krogen
> Priority: Critical
> Fix For: 2.9.0, 2.7.4, 3.0.0-alpha3, 2.8.1
>
> Attachments: HDFS-11709.000.patch
>
>
> In {{StandbyCheckpointer}}, if the legacy OIV directory is not properly
> created, or was deleted for some reason (e.g. mis-operation), all checkpoint
> ops will fall. Not only the ANN won't receive new fsimages, the JNs will get
> full with edit log files, and cause NN to crash.
> {code}
> // Save the legacy OIV image, if the output dir is defined.
> String outputDir = checkpointConf.getLegacyOivImageDir();
> if (outputDir != null && !outputDir.isEmpty()) {
> img.saveLegacyOIVImage(namesystem, outputDir, canceler);
> }
> {code}
> It doesn't make sense to let such an unimportant part (saving OIV) abort all
> checkpoints and cause NN crash (and possibly lose data).
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]