[ 
https://issues.apache.org/jira/browse/HADOOP-3915?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andreas Kostyrka updated HADOOP-3915:
-------------------------------------

    Attachment: 
hadoop-hadoop-jobtracker-ec2-67-202-58-97.compute-1.amazonaws.com.log

The job tracker log, you can see the attempts that I made to use hadoop job 
-kill-task

Furthermore, you'll notice that the reducers in question get started, but then 
disappear:

[EMAIL PROTECTED]:/tmp/bug$ grep tip_200808070013_0001_r_000020 
hadoop-hadoop-jobtracker-ec2-67-202-58-97.compute-1.amazonaws.com.log
2008-08-07 00:15:07,184 INFO org.apache.hadoop.mapred.JobTracker: Adding task 
'task_200808070013_0001_r_000020_0' to tip tip_200808070013_0001_r_000020, for 
tracker 
'tracker_ec2-75-101-217-112.compute-1.amazonaws.com:localhost/127.0.0.1:53783'
2008-08-07 05:51:35,978 INFO org.apache.hadoop.mapred.JobTracker: Kill task 
attempt failed since task tip_200808070013_0001_r_000020 was not found

The last mapper was finished around or before 3:30 GMT (if I did not overlook 
one),
and the reducers produced a long repetition of:

2008-08-07 07:02:39,191 INFO org.apache.hadoop.mapred.ReduceTask: 
task_200808070013_0001_r_000013_0 Got 0 known map output location(s); 
scheduling...
2008-08-07 07:02:39,191 INFO org.apache.hadoop.mapred.ReduceTask: 
task_200808070013_0001_r_000013_0 Scheduled 0 of 0 known outputs (0 slow hosts 
and 0 dup hosts)
2008-08-07 07:02:44,200 INFO org.apache.hadoop.mapred.ReduceTask: 
task_200808070013_0001_r_000013_0 Need 67 map output(s)
2008-08-07 07:02:44,200 INFO org.apache.hadoop.mapred.ReduceTask: 
task_200808070013_0001_r_000013_0: Got 0 new map-outputs & 0 obsolete 
map-outputs from tasktracker and 0 map-outputs from previous failures


> reducers hang, jobtracker loosing completely track of them.
> -----------------------------------------------------------
>
>                 Key: HADOOP-3915
>                 URL: https://issues.apache.org/jira/browse/HADOOP-3915
>             Project: Hadoop Core
>          Issue Type: Bug
>    Affects Versions: 0.17.1
>         Environment: EC2, Debian Etch  (but not the ec2-contrib stuff)
> streaming.jar
>            Reporter: Andreas Kostyrka
>             Fix For: 0.17.2
>
>         Attachments: 
> hadoop-hadoop-jobtracker-ec2-67-202-58-97.compute-1.amazonaws.com.log
>
>
> I just noticed the following curious situation:
> -) 18 of 22 reducers are waiting for 3 hours or so with 0.01MB/s and no 
> progress.
> -) hadoop job -kill-task does not work on the ids shown
> -) killing all reduce work tasks (the spawned Python processes, not java 
> TaskTracker$Child) gets completely ignored by the JobTracker, the jobtracker 
> shows them still as running.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to