Hi All,

We had a hardware failure on our namenode that led to corruption in our
DFS.  I ran an fsck and moved the corrupted files to a lost+found
directory.  The DFS now seems to run fine by itself.  However, if I run
HBase following the fsck, I get a bunch of FileNotFound exceptions as it
tries to access some of the files that were corrupted.  This ultimately
seems to lead to the HMaster getting in a bad state where it doesn't
respond.

So I'm wondering if there is a way to recover from my current state.  If
that's not possible, how could I handle things differently to recover from a
namenode failure in the future.  I see there is an open issue HBASE-7 about
an fsck-like feature for HBase.  However, it's not clear to me whether this
proposal would even address this problem.  Any pointers appreciated.

We're running Hadoop 0.18.2 and HBase 0.18.1.

Thanks!

Reply via email to