Reducers stuck in 'sort' ------------------------ Key: HADOOP-1174 URL: https://issues.apache.org/jira/browse/HADOOP-1174 Project: Hadoop Issue Type: Bug Components: mapred Affects Versions: 0.12.2 Reporter: Arun C Murthy
A couple of reduces seem stuck on a small 20-node cluster in the 'sort' phase for almost an hour: TaskTracker logs: ------------------------ 2007-03-28 14:13:46,471 INFO org.apache.hadoop.mapred.TaskTracker: task_0002_r_000005_0 0.33333334% reduce > sort 2007-03-28 14:13:46,478 INFO org.apache.hadoop.mapred.TaskTracker: task_0002_r_000009_0 0.33333334% reduce > sort 2007-03-28 14:13:47,476 INFO org.apache.hadoop.mapred.TaskTracker: task_0002_r_000005_0 0.33333334% reduce > sort 2007-03-28 14:13:47,483 INFO org.apache.hadoop.mapred.TaskTracker: task_0002_r_000009_0 0.33333334% reduce > sort ... ... ... 2007-03-28 15:06:04,376 INFO org.apache.hadoop.mapred.TaskTracker: task_0002_r_000005_0 0.33333334% reduce > sort 2007-03-28 15:06:04,411 INFO org.apache.hadoop.mapred.TaskTracker: task_0002_r_000009_0 0.33333334% reduce > sort 2007-03-28 15:06:05,379 INFO org.apache.hadoop.mapred.TaskTracker: task_0002_r_000005_0 0.33333334% reduce > sort 2007-03-28 15:06:05,414 INFO org.apache.hadoop.mapred.TaskTracker: task_0002_r_000009_0 0.33333334% reduce > sort Eventually the JobTracker declared the same TT 'lost' (presumably for no heartbeats): 2007-03-28 15:18:20,341 INFO org.apache.hadoop.mapred.JobTracker: Lost tracker 'tracker_XXX:9020' -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.