Hi, Running 0.90 in dev cluster where I used cdh3b2 hadoop jar, I frequently saw the following in reduce task log:
INFO [2011-01-24 15:27:39] (ExecUtil.java:258) - 2011-01-24 22:55:39,009 INFO com.carrieriq.m2m.platform.mmp3.output.DimensionMapper: Total requets=15523640 cache hit ratio=0.84543097 avg time=90.1465879780713 INFO [2011-01-24 15:27:39] (ExecUtil.java:258) - 2011-01-24 23:17:03,216 WARN org.apache.hadoop.hdfs.DFSClient: DFSOutputStream ResponseProcessor exception for block blk_8207645655823156697_2836871java.io.IOException: Bad response 1 for block blk_8207645655823156697_2836871 from datanode 10.202.50.71:50010 INFO [2011-01-24 15:27:39] (ExecUtil.java:258) - at org.apache.hadoop.hdfs.DFSClient$DFSOutputStream$ResponseProcessor.run(DFSClient.java:2497) INFO [2011-01-24 15:27:39] (ExecUtil.java:258) - INFO [2011-01-24 15:27:39] (ExecUtil.java:258) - 2011-01-24 23:17:03,217 WARN org.apache.hadoop.hdfs.DFSClient: Error Recovery for block blk_8207645655823156697_2836871 bad datanode[1] 10.202.50.71:50010 INFO [2011-01-24 15:27:39] (ExecUtil.java:258) - 2011-01-24 23:17:03,217 WARN org.apache.hadoop.hdfs.DFSClient: Error Recovery for block blk_8207645655823156697_2836871 in pipeline 10.202.50.78:50010, 10.202.50.71:50010: bad datanode 10.202.50.71:50010 INFO [2011-01-24 15:27:39] (ExecUtil.java:258) - 2011-01-24 23:17:03,252 INFO org.apache.hadoop.ipc.Client: Retrying connect to server: / 10.202.50.78:50020. Already tried 0 time(s). INFO [2011-01-24 15:27:39] (ExecUtil.java:258) - 2011-01-24 23:27:27,931 WARN org.apache.hadoop.mapred.TaskRunner: Parent died. Exiting HDFS-895 <https://issues.apache.org/jira/browse/HDFS-895> is in http://archive.cloudera.com/cdh/3/hadoop-0.20.2+320.releasenotes.html Expert opinion on what I saw is appreciated.
