A Datanode is declared dead if heartbeats are missing for 10 minutes. The
Datanodes typically send a heartbeat every 3 seconds.

Thanks,
dhruba

-----Original Message-----
From: Hairong Kuang [mailto:[EMAIL PROTECTED] 
Sent: Tuesday, July 17, 2007 12:30 PM
To: [email protected]
Subject: RE: HDFS replica management

>> Which package in the
>> source code could I look at to glean this information ?

>This is in dfs/FSNameSystem.java.

FSNameSystem.java is a huge chunk of source code. To be more specific,
datanode failure detection is done by HeartbeatMonitor. Once a data node is
detected as dead, all blocks belonged to this data node will be put in
neededReplications queue. Then the ReplicationMonitor will start to
replicate those under-replicated blocks. All the replication target chosen
logic is in dfs/ReplicationTargetChooser.java.

Hairong


Reply via email to