[ 
https://issues.apache.org/jira/browse/HDFS-1496?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12931589#action_12931589
 ] 

Hairong Kuang commented on HDFS-1496:
-------------------------------------

This turns out to be a bug in storage directory restoration. Image validation 
exposes the error.

Currently NN uses rollFSEdits to trigger storage directory recovery. The 
recovery may trigger a saving of the namespace to the newly restored directory 
which as a result changes in memory image digest. However later on image & 
edits were fetched from an old storage directory, thus causing the checksum 
mismatch.

The problem with this storage restoration scheme is that it makes the on-disk 
state of all storage directories inconsistent.

> TestStorageRestore is failing after HDFS-903 fix
> ------------------------------------------------
>
>                 Key: HDFS-1496
>                 URL: https://issues.apache.org/jira/browse/HDFS-1496
>             Project: Hadoop HDFS
>          Issue Type: Bug
>          Components: test
>            Reporter: Konstantin Boudnik
>            Assignee: Hairong Kuang
>
> TestStorageRestore seems to be failing after HDFS-903 commit. Running git 
> bisect confirms it.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to