[ 
https://issues.apache.org/jira/browse/HADOOP-4372?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12711523#action_12711523
 ] 

Devaraj Das commented on HADOOP-4372:
-------------------------------------

I committed this to 0.20 branch as well. There have been problems like job 
submission taking a long time when the number of files in the history folder 
are too large. This patch introduced an API that would fasten the process of 
obtaining a history file for a new job (earlier it used to do a scan of the 
history folder which is costly when the number of files are large).

> Improve the way the job history files are managed during job recovery
> ---------------------------------------------------------------------
>
>                 Key: HADOOP-4372
>                 URL: https://issues.apache.org/jira/browse/HADOOP-4372
>             Project: Hadoop Core
>          Issue Type: Improvement
>          Components: mapred
>            Reporter: Amar Kamat
>            Assignee: Amar Kamat
>             Fix For: 0.21.0
>
>         Attachments: HADOOP-4372-v1.4.patch, HADOOP-4372-v1.patch, 
> HADOOP-4372-v3.0.patch, HADOOP-4372-v3.1.patch
>
>
> Today we use the _.recover_ technique to handle the job history files when 
> the jobtracker restarts. The comment 
> [here|https://issues.apache.org/jira/browse/HADOOP-3245?focusedCommentId=12629080#action_12629080]
>  proposes a better way to handle the files. 

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to