[
https://issues.apache.org/jira/browse/MAPREDUCE-4680?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13806966#comment-13806966
]
Sandy Ryza commented on MAPREDUCE-4680:
---------------------------------------
Makes sense. +1, will commit this later today.
> Job history cleaner should only check timestamps of files in old enough
> directories
> -----------------------------------------------------------------------------------
>
> Key: MAPREDUCE-4680
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-4680
> Project: Hadoop Map/Reduce
> Issue Type: Bug
> Components: jobhistoryserver
> Affects Versions: 2.0.0-alpha
> Reporter: Sandy Ryza
> Assignee: Robert Kanter
> Attachments: MAPREDUCE-4680.patch, MAPREDUCE-4680.patch,
> MAPREDUCE-4680.patch, MAPREDUCE-4680.patch, MAPREDUCE-4680.patch
>
>
> Job history files are stored in yyyy/mm/dd folders. Currently, the job
> history cleaner checks the modification date of each file in every one of
> these folders to see whether it's past the maximum age. The load on HDFS
> could be reduced by only checking the ages of files in directories that are
> old enough, as determined by their name.
--
This message was sent by Atlassian JIRA
(v6.1#6144)