ConcurrentModificationException in org.apache.hadoop.ipc.Server.Responder -------------------------------------------------------------------------
Key: HADOOP-2492 URL: https://issues.apache.org/jira/browse/HADOOP-2492 Project: Hadoop Issue Type: Bug Components: ipc Affects Versions: 0.16.0 Reporter: Devaraj Das Fix For: 0.16.0 I was running hadoop on 800 machines and after running a couple of jobs, and running 100% of the maps of the current job, the JobTracker stopped responding - *all* tasktrackers were lost ... When I looked at the JT logs, these seemed alarming: 2007-12-26 19:18:30,185 WARN org.apache.hadoop.ipc.Server: Exception in Responder java.util.ConcurrentModificationException Following the above exception, I saw a whole lot of exceptions like: 2007-12-26 19:23:10,926 WARN org.apache.hadoop.ipc.Server: Call queue overflow discarding oldest call heartbeat([EMAIL PROTECTED], false, true, 1758) from 1.2.3.4:1234 >From the number of exceptions to do with call queue overflow, it seemed like >the jobtracker was not processing RPCs after it got the >ConcurrentModificationException, and around that time the tasktrackers started >getting timeouts on RPCs... There were two occurrences of the ConcurrentModificationException but the first instance seemed to not have any effect on the call queue... -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.