Ah, yes, very interesting. I get a "Read timed out" on 10.100.11.31, followed by a bunch of "Served block" messages... then finally a Transmitted block once HDFS was restarted.

hadoop-rapleaf-datanode-tf4.rapleaf.com.log.2008-01-02:2008-01-02 17:08:17,057 INFO org.apache.hadoop.dfs.DataNode: Received block blk_4522585614366970680 from /10.100.11.31 and Read timed out hadoop-rapleaf-datanode-tf4.rapleaf.com.log.2008-01-02:2008-01-02 17:13:17,281 INFO org.apache.hadoop.dfs.DataNode: Served block blk_4522585614366970680 to /10.100.11.31
------ a bunch of the above message
hadoop-rapleaf-datanode-tf4.rapleaf.com.log.2008-01-02:2008-01-02 18:53:18,737 INFO org.apache.hadoop.dfs.DataNode: Served block blk_4522585614366970680 to /10.100.11.31
------ HDFS restarted
hadoop-rapleaf-datanode-tf4.rapleaf.com.log.2008-01-03:2008-01-03 16:17:59,637 INFO org.apache.hadoop.dfs.DataNode: Starting thread to transfer block blk_4522585614366970680 to [Lorg.apache.hadoop.dfs.DatanodeInfo;@7e9ffe3f hadoop-rapleaf-datanode-tf4.rapleaf.com.log.2008-01-03:2008-01-03 16:17:59,718 INFO org.apache.hadoop.dfs.DataNode: Transmitted block blk_4522585614366970680 to /10.100.11.59:7277

On Jan 7, 2008, at 11:06 PM, Raghu Angadi wrote:

hmm... one possibility is that rest of the nodes were down. but the name node showed other nodes were up. If more than one datanodes were up, this indicates some bug.

One last thing : grep for this block id at 10.100.11.31. You might see some useful error message when the block was written.

thanks,
Raghu.

Chris Kline wrote:
Sure.  Here are 2 blocks.
----- blk_4522585614366970680
hadoop-rapleaf-namenode-tf1.rapleaf.com.log.2008-01-02:2008-01-02 17:06:17,058 INFO org.apache.hadoop.dfs.StateChange: BLOCK* NameSystem.allocateBlock: /data/hbase1/hregion_70236052/ compaction.dir/hregion_70236052/info/mapfiles/-1/index. blk_4522585614366970680 hadoop-rapleaf-namenode-tf1.rapleaf.com.log.2008-01-02:2008-01-02 17:06:17,062 INFO org.apache.hadoop.dfs.StateChange: BLOCK* NameSystem.addStoredBlock: blockMap updated: 10.100.11.31:7277 is added to blk_4522585614366970680
------ Note that the following is AFTER I restarted HDFS -------
hadoop-rapleaf-namenode-tf1.rapleaf.com.log.2008-01-03:2008-01-03 16:17:15,882 DEBUG org.apache.hadoop.dfs.StateChange: UnderReplicationBlocks.update blk_4522585614366970680 curReplicas 1 curExpectedReplicas 3 oldReplicas 0 oldExpectedReplicas 3 curPri 0 oldPri 3 hadoop-rapleaf-namenode-tf1.rapleaf.com.log.2008-01-03:2008-01-03 16:17:15,882 DEBUG org.apache.hadoop.dfs.StateChange: BLOCK* NameSystem.UnderReplicationBlock.update:blk_4522585614366970680 has only 1 replicas and need 3 replicas so is added to neededReplications at priority level 0 hadoop-rapleaf-namenode-tf1.rapleaf.com.log.2008-01-03:2008-01-03 16:17:58,808 DEBUG org.apache.hadoop.dfs.StateChange: BLOCK* NameSystem.UnderReplicationBlock.remove: Removing block blk_4522585614366970680 from priority queue 0 hadoop-rapleaf-namenode-tf1.rapleaf.com.log.2008-01-03:2008-01-03 16:17:58,808 DEBUG org.apache.hadoop.dfs.StateChange: BLOCK* NameSystem.pendingTransfer: blk_4522585614366970680 is removed from neededReplications to pendingReplications hadoop-rapleaf-namenode-tf1.rapleaf.com.log.2008-01-03:2008-01-03 16:17:58,808 INFO org.apache.hadoop.dfs.StateChange: BLOCK* NameSystem.pendingTransfer: ask 10.100.11.31:7277 to replicate blk_4522585614366970680 to datanode(s) 10.100.11.59:7277 10.100.11.64:7277 hadoop-rapleaf-namenode-tf1.rapleaf.com.log.2008-01-03:2008-01-03 16:17:59,716 DEBUG org.apache.hadoop.dfs.StateChange: BLOCK* NameSystem.blockReceived: blk_4522585614366970680 is received from 10.100.11.59:7277 hadoop-rapleaf-namenode-tf1.rapleaf.com.log.2008-01-03:2008-01-03 16:17:59,716 INFO org.apache.hadoop.dfs.StateChange: BLOCK* NameSystem.addStoredBlock: blockMap updated: 10.100.11.59:7277 is added to blk_4522585614366970680 hadoop-rapleaf-namenode-tf1.rapleaf.com.log.2008-01-03:2008-01-03 16:17:59,717 DEBUG org.apache.hadoop.dfs.StateChange: BLOCK* NameSystem.blockReceived: blk_4522585614366970680 is received from 10.100.11.64:7277 hadoop-rapleaf-namenode-tf1.rapleaf.com.log.2008-01-03:2008-01-03 16:17:59,717 INFO org.apache.hadoop.dfs.StateChange: BLOCK* NameSystem.addStoredBlock: blockMap updated: 10.100.11.64:7277 is added to blk_4522585614366970680

-Chris

We're hiring engineers.  $10,007 reward for referrals we hire.


Reply via email to