[
https://issues.apache.org/jira/browse/HADOOP-2141?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12567071#action_12567071
]
Amar Kamat commented on HADOOP-2141:
------------------------------------
Dumping some logs from my recent runs
{noformat}
2008-02-08 15:09:19,459 INFO org.apache.hadoop.mapred.TaskInProgress: Error
from task_200802080908_0005_r_001459_0: Task task_200802080908_0005_r_001459_0
failed to report status for 605 seconds. Killing!
2008-02-08 15:09:19,460 INFO org.apache.hadoop.mapred.JobTracker: Removed
completed task 'task_200802080908_0005_r_001459_0' from
'tracker_gs205019.inktomisearch.com:gs205019.inktomisearch.com/76.13.184.103:58495'
2008-02-08 15:09:19,474 INFO org.apache.hadoop.mapred.TaskRunner: Discarded
output of task 'task_200802080908_0005_r_001459_0' -
hdfs://gs205514.inktomisearch.com:57972/user/amarrk/output/_task_200802080908_0005_r_001459_0
2008-02-08 15:09:19,513 INFO org.apache.hadoop.mapred.JobInProgress: Choosing
normal task tip_200802080908_0005_r_001459
2008-02-08 15:09:19,514 INFO org.apache.hadoop.mapred.JobTracker: Adding task
'task_200802080908_0005_r_001459_1' to tip tip_200802080908_0005_r_001459, for
tracker
'tracker_gs205440.inktomisearch.com:gs205440.inktomisearch.com/76.13.187.49:55843'
2008-02-08 15:09:19,517 INFO org.apache.hadoop.mapred.JobInProgress: Choosing
speculative task tip_200802080908_0005_r_001459
2008-02-08 15:09:19,517 INFO org.apache.hadoop.mapred.JobTracker: Adding task
'task_200802080908_0005_r_001459_2' to tip tip_200802080908_0005_r_001459, for
tracker
'tracker_gs205190.inktomisearch.com:gs205190.inktomisearch.com/76.13.185.109:58837'
{noformat}
The main task and speculative one got executed back to back.
> speculative execution start up condition based on completion time
> -----------------------------------------------------------------
>
> Key: HADOOP-2141
> URL: https://issues.apache.org/jira/browse/HADOOP-2141
> Project: Hadoop Core
> Issue Type: Improvement
> Components: mapred
> Affects Versions: 0.15.0
> Reporter: Koji Noguchi
> Assignee: Arun C Murthy
> Fix For: 0.17.0
>
>
> We had one job with speculative execution hang.
> 4 reduce tasks were stuck with 95% completion because of a bad disk.
> Devaraj pointed out
> bq . One of the conditions that must be met for launching a speculative
> instance of a task is that it must be at least 20% behind the average
> progress, and this is not true here.
> It would be nice if speculative execution also starts up when tasks stop
> making progress.
> Devaraj suggested
> bq. Maybe, we should introduce a condition for average completion time for
> tasks in the speculative execution check.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.