Hi All, We had a hardware failure on our namenode that led to corruption in our DFS. I ran an fsck and moved the corrupted files to a lost+found directory. The DFS now seems to run fine by itself. However, if I run HBase following the fsck, I get a bunch of FileNotFound exceptions as it tries to access some of the files that were corrupted. This ultimately seems to lead to the HMaster getting in a bad state where it doesn't respond.
So I'm wondering if there is a way to recover from my current state. If that's not possible, how could I handle things differently to recover from a namenode failure in the future. I see there is an open issue HBASE-7 about an fsck-like feature for HBase. However, it's not clear to me whether this proposal would even address this problem. Any pointers appreciated. We're running Hadoop 0.18.2 and HBase 0.18.1. Thanks!
