[ 
https://issues.apache.org/jira/browse/HADOOP-4995?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12662194#action_12662194
 ] 

Brian Bockelman commented on HADOOP-4995:
-----------------------------------------

Re: Konstantin: I still consider the secondary name-node part of the "online 
system".

I want to be able to take a completely offline image - perhaps something we 
pulled off the tape - and make sure that it's at least valid enough that a 
namenode could load it into memory.  It'd be a way that we can do a "light 
audit" of our backup copies.

Currently, the best we can do is "try and pray" (and it's a manual process).

> Offline Namenode fsImage verification
> -------------------------------------
>
>                 Key: HADOOP-4995
>                 URL: https://issues.apache.org/jira/browse/HADOOP-4995
>             Project: Hadoop Core
>          Issue Type: New Feature
>            Reporter: Brian Bockelman
>
> Currently, there is no way to verify that a copy of the fsImage is not 
> corrupt.  I propose that we should have an offline tool that loads the 
> fsImage into memory to see if it is usable.  This will allow us to automate 
> backup testing to some extent.
> One can start a namenode process on the fsImage to see if it can be loaded, 
> but this is not easy to automate.
> To use HDFS in production, it is greatly desired to have both checkpoints - 
> and have some idea that the checkpoints are valid!  No one wants to see the 
> day where they reload from backup only to find that the fsImage in the backup 
> wasn't usable.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to