Sometimes, Reduce tasks hang, State is unassigned
-------------------------------------------------

                 Key: HADOOP-5407
                 URL: https://issues.apache.org/jira/browse/HADOOP-5407
             Project: Hadoop Core
          Issue Type: Bug
          Components: mapred
    Affects Versions: 0.19.0
            Reporter: ZhuGuanyin


Hi, all

When our cluster runs for a long time, some reduce tasks running on some 
tasktrackers hang. Their states are UNASSIGNED.  Then, all reduce tasks on 
these tasktracker will hang.

We kill the hang reduce task, then the reduce task attempt is re-scheduled to 
this tasktracker, the attempt task continues to hang. We fail it, it goes to 
another tasktracker, it is executed successfully. 

Tasktracker which has hang reduce task will receive new reduce task, but the 
reduce  task continue to hang for ever.

When we reboot the tasktracker machine, reduce task no longer hangs.


-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to