Final map task gets stuck ------------------------- Key: HADOOP-707 URL: http://issues.apache.org/jira/browse/HADOOP-707 Project: Hadoop Issue Type: Bug Components: mapred Affects Versions: 0.8.0 Environment: using latest trunk Reporter: Johan Oskarson Priority: Critical
I've seen numerous jobs lately where the final map task gets stuck, never finishing. The jobtracker doesn't reassign the task. A restart of the tasktracker solves the issue and the job can finish. In the web interface it turns up as: task_0028_m_000534_0 node17.herd1 RUNNING 0.00% 10-Nov-2006 12:21:12 10-Nov-2006 12:22:19 (1mins, 6sec) Task failed to report status for 604 seconds. Killing. Only exception I find in that tasktracker log is this (a few times): java.nio.channels.ClosedChannelException at sun.nio.ch.SocketChannelImpl.ensureWriteOpen(SocketChannelImpl.java:125) at sun.nio.ch.SocketChannelImpl.write(SocketChannelImpl.java:294) at org.apache.hadoop.ipc.SocketChannelOutputStream.flushBuffer(SocketChannelOutputStream.java:108) at org.apache.hadoop.ipc.SocketChannelOutputStream.write(SocketChannelOutputStream.java:89) at java.io.BufferedOutputStream.flushBuffer(BufferedOutputStream.java:65) at java.io.BufferedOutputStream.flush(BufferedOutputStream.java:123) at java.io.DataOutputStream.flush(DataOutputStream.java:106) at org.apache.hadoop.ipc.Server$Handler.run(Server.java:532) -- This message is automatically generated by JIRA. - If you think it was sent incorrectly contact one of the administrators: http://issues.apache.org/jira/secure/Administrators.jspa - For more information on JIRA, see: http://www.atlassian.com/software/jira