[ 
https://issues.apache.org/jira/browse/HADOOP-5394?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12679949#action_12679949
 ] 

Devaraj Das commented on HADOOP-5394:
-------------------------------------

I suggest we move to the model of moving to the model where the restart count 
is based on the number of times the JobTracker got restarted rather than 
associating the count with a per job restart (as it is today). The 
restart-count read/update could be the first thing that the JT ever does as 
soon as it starts up.

> JobTracker might schedule 2 attempts of the same task with the same attempt 
> id across restarts
> ----------------------------------------------------------------------------------------------
>
>                 Key: HADOOP-5394
>                 URL: https://issues.apache.org/jira/browse/HADOOP-5394
>             Project: Hadoop Core
>          Issue Type: Bug
>          Components: mapred
>            Reporter: Amar Kamat
>            Assignee: Amar Kamat
>            Priority: Critical
>
> This can happen when the jobtracker gets restarted more than once. In such 
> cases, the jobtracker depends on the jobhistory file for the next restart 
> count. If the new restart-count is not flushed to the file then there is a 
> fair chance that upon next restart, the jobtracker might schedule a new 
> attempt with an existing id. This can cause problems not only with the 
> side-effect files but also can cause the jobtracker to be in an inconsistent 
> state.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to