Corrupt edits/edits.new file maybe?

Usman Waheed Wed, 09 Mar 2011 15:07:31 -0800

Hi,

For some reason my secondary namenode process died 10 days ago and thathas left me with an edits and edits.new files in mydfs/name/current directory. The fsimage file is also there but is old anddoes not have the merged changes from either the edits or the edits.new.The cluster has been running fine since the last startup which was 2 weeksago.

Today i restarted the cluster and now the namenode complains with a NULLPOINTER EXCEPTION. The last checkpoint saved is of the same size as thefsimage in the current directory so replacing it will not help.

This is a test cluster so worst case is i loose many changes that were notmerged into the fsimage. I can remove the edits.new and bring the clusterup with a clean edits file. Will have to force the namenode out of safemode but then running fsck complains that HDFS is corrupt, obviouslymissing blocks/files etc.

The question i have is if there is any way to salvage from such asituation? I read that one can maybe tamper with the edits and edits.newfiles to bring up the namenode but with minimum loss of data. This wouldrequire editing these files in a hex editor?

Is there any documentation/example maybe on how to do this or maybe it isnot possible and not worth the effort. It would be good to know if thereis a way out from such a situation.


I have a 3 node test cluster running Hadoop 0.20.2+737.

Appreciate if i can get any help/pointers.

Thanks,
Usman

--
Using Opera's revolutionary email client: http://www.opera.com/mail/

Corrupt edits/edits.new file maybe?

Reply via email to