[
https://issues.apache.org/jira/browse/MAPREDUCE-2589?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sherry Chen updated MAPREDUCE-2589:
-----------------------------------
Attachment: MAPREDUCE-2589.patch
Attached fix will delete leftover user logs which are last modified 7 days ago
and are not for any running jobs when TaskTracker restarts.
UserLogCleaner still
takes care of normal user logs cleanup.
DEFAULT_USER_LOG_RETAIN_HOURS is 1 day, so I added new
DEFAULT_USER_LOG_RETAIN_MAX_HOURS as 7 days.
I would like to have a separate JIRA to handle cleanup old userlogs based on
userlog disk space water mark since which may deal with job configuration
settings
change.
> TaskTracker not purging userlog directories
> -------------------------------------------
>
> Key: MAPREDUCE-2589
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2589
> Project: Hadoop Map/Reduce
> Issue Type: Bug
> Components: tasktracker
> Affects Versions: 0.20.205.0
> Environment: 0.20.205
> Reporter: Sherry Chen
> Assignee: Sherry Chen
> Priority: Minor
> Attachments: MAPREDUCE-2589.patch, cleanup_userlogs.py
>
>
> UserLogCleaner is not robust. Leftover userlogs after a restart sometimes
> have to be manually
> cleaned. Things can accumulate over a period of time.
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira