Reducers stuck in 'sort'
------------------------

                 Key: HADOOP-1174
                 URL: https://issues.apache.org/jira/browse/HADOOP-1174
             Project: Hadoop
          Issue Type: Bug
          Components: mapred
    Affects Versions: 0.12.2
            Reporter: Arun C Murthy


A couple of reduces seem stuck on a small 20-node cluster in the 'sort' phase 
for almost an hour:

TaskTracker logs:
------------------------
2007-03-28 14:13:46,471 INFO org.apache.hadoop.mapred.TaskTracker: 
task_0002_r_000005_0 0.33333334% reduce > sort
2007-03-28 14:13:46,478 INFO org.apache.hadoop.mapred.TaskTracker: 
task_0002_r_000009_0 0.33333334% reduce > sort
2007-03-28 14:13:47,476 INFO org.apache.hadoop.mapred.TaskTracker: 
task_0002_r_000005_0 0.33333334% reduce > sort
2007-03-28 14:13:47,483 INFO org.apache.hadoop.mapred.TaskTracker: 
task_0002_r_000009_0 0.33333334% reduce > sort
...
...
...
2007-03-28 15:06:04,376 INFO org.apache.hadoop.mapred.TaskTracker: 
task_0002_r_000005_0 0.33333334% reduce > sort
2007-03-28 15:06:04,411 INFO org.apache.hadoop.mapred.TaskTracker: 
task_0002_r_000009_0 0.33333334% reduce > sort
2007-03-28 15:06:05,379 INFO org.apache.hadoop.mapred.TaskTracker: 
task_0002_r_000005_0 0.33333334% reduce > sort
2007-03-28 15:06:05,414 INFO org.apache.hadoop.mapred.TaskTracker: 
task_0002_r_000009_0 0.33333334% reduce > sort


Eventually the JobTracker declared the same TT 'lost' (presumably for no 
heartbeats):

2007-03-28 15:18:20,341 INFO org.apache.hadoop.mapred.JobTracker: Lost tracker 
'tracker_XXX:9020'


-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to