Re: DFSck - fsck for hadoop

Eric Baldeschwieler Sat, 25 Mar 2006 21:30:28 -0800

+1 again 8-)

On Mar 23, 2006, at 2:26 PM, Yoram Arnon wrote:

Another idea, in addition to an explicit format command, is toconfigure thename node with the cluster's data nodes, rather than allowing anynode toconnect ad hoc. A name node would then ignore an unexpected datanode. Itwould also be able to report when a data node is missing and couldmakeoperational decisions based on the number and identity of nodesthat are up
vs. down.

-----Original Message-----
From: Doug Cutting [mailto:[EMAIL PROTECTED]
Sent: Thursday, March 23, 2006 12:27 PM
To: [email protected]
Subject: Re: DFSck - fsck for hadoop

[EMAIL PROTECTED] wrote:
My error was that I intended to run nutch0 as job.tracker, but not as
a datanode.  So, when I ran bin/start-all.sh to start the cluster, it
seemed to replicate the non-existent filesystem on nutch0; thereby
starting to delete all my precious data.
It would be nice if this were harder to do. A simple solution Iproposedwould be to make it so that a new filesystem is not createdautomaticallywhen a namenode is started in an empty directory. Rather a'format' commandcould be required. A more complex solution might be to have afilesystem
id.  For example, some bits from each block id issued could be the
filesystem id. When datanodes report blocks from a differentfilesystem,
the namenode would ignore them rather than delete them.

Doug

Re: DFSck - fsck for hadoop

Reply via email to