Re: DFSck - fsck for hadoop

Doug Cutting Thu, 23 Mar 2006 12:27:58 -0800

[EMAIL PROTECTED] wrote:

My error was that I intended to run nutch0 as job.tracker, but not as a
datanode.  So, when I ran bin/start-all.sh to start the cluster, it seemed to
replicate the non-existent filesystem on nutch0; thereby starting to delete all
my precious data.

It would be nice if this were harder to do. A simple solution Iproposed would be to make it so that a new filesystem is not createdautomatically when a namenode is started in an empty directory. Rathera 'format' command could be required. A more complex solution might beto have a filesystem id. For example, some bits from each block idissued could be the filesystem id. When datanodes report blocks from adifferent filesystem, the namenode would ignore them rather than deletethem.


Doug

Re: DFSck - fsck for hadoop

Reply via email to