I had 20 mappers in parallel reading 20 gz files and each file around 30-40MB data over 5 hadoop nodes and then writing to the analytics database. Almost midway it started to get this error:
2012-04-26 16:13:53,723 [Thread-8] INFO org.apache.hadoop.hdfs.DFSClient - Exception in createBlockOutputStream 17.18.62.192:50010java.io.IOException: Bad connect ack with firstBadLink as 17.18.62.191:50010 I am trying to look at the logs but doesn't say much. What could be the reason? We are in pretty closed reliable network and all machines are up.