[ https://issues.apache.org/jira/browse/HADOOP-4372?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12711523#action_12711523 ]
Devaraj Das commented on HADOOP-4372: ------------------------------------- I committed this to 0.20 branch as well. There have been problems like job submission taking a long time when the number of files in the history folder are too large. This patch introduced an API that would fasten the process of obtaining a history file for a new job (earlier it used to do a scan of the history folder which is costly when the number of files are large). > Improve the way the job history files are managed during job recovery > --------------------------------------------------------------------- > > Key: HADOOP-4372 > URL: https://issues.apache.org/jira/browse/HADOOP-4372 > Project: Hadoop Core > Issue Type: Improvement > Components: mapred > Reporter: Amar Kamat > Assignee: Amar Kamat > Fix For: 0.21.0 > > Attachments: HADOOP-4372-v1.4.patch, HADOOP-4372-v1.patch, > HADOOP-4372-v3.0.patch, HADOOP-4372-v3.1.patch > > > Today we use the _.recover_ technique to handle the job history files when > the jobtracker restarts. The comment > [here|https://issues.apache.org/jira/browse/HADOOP-3245?focusedCommentId=12629080#action_12629080] > proposes a better way to handle the files. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.