Damn I found the answer to this problem, thanks to someone on the #hadoop IRC channel ...
It was a network check I added for our supervision ... therefore every 5 minutes the supervision connects to the datanode port to check if it is alive and then disconnects ... Sorry for the dumb problem ........ On 04/10/10 15:06, Arthur Caranta wrote: > Hi everyone. > > This is my first post on this mailing list and I hope I won't do any > mistakes ;) > > I work for a French telecom operator ad we have recently started a > Hadoop project. > > We currently have two clusters: > -a preproduction cluster running with 1 master plus 4 nodes > -a production cluster running with 1 master plus 24 nodes > > Both clusters are running Hadoop 0.20.2 and the jobs are running smoothly. > > However, on every datanodes of each cluster I receive, every 5 minutes, > I see this error : > 2010-10-04 14:31:46,564 ERROR > org.apache.hadoop.hdfs.server.datanode.DataNode: > DatanodeRegistration(xx.xx.xx.xx:50010, > storageID=DS-1295517999-xx.xx.xx.xx-50010-1280483437127, infoPort=50075, > ipcPort=50020):DataXceiver > java.io.EOFException > at java.io.DataInputStream.readShort(DataInputStream.java:298) > at > org.apache.hadoop.hdfs.server.datanode.DataXceiver.run(DataXceiver.java:79) > at java.lang.Thread.run(Thread.java:619) > > 2010-10-04 14:36:46,804 ERROR > org.apache.hadoop.hdfs.server.datanode.DataNode: > DatanodeRegistration(xx.xx.xx.xx:50010, > storageID=DS-1295517999-xx.xx.xx.xx-50010-1280483437127, infoPort=50075, > ipcPort=50020):DataXceiver > java.io.EOFException > at java.io.DataInputStream.readShort(DataInputStream.java:298) > at > org.apache.hadoop.hdfs.server.datanode.DataXceiver.run(DataXceiver.java:79) > at java.lang.Thread.run(Thread.java:619) > > > And I can't find anything on the web ... > > Anyone has a clue ? > > Best Regards, > -- ------------------------------ Arthur CARANTA [email protected] http://www.caranta.com ------------------------------
