Agent side file expiration/removal
----------------------------------

                 Key: CHUKWA-204
                 URL: https://issues.apache.org/jira/browse/CHUKWA-204
             Project: Hadoop Chukwa
          Issue Type: New Feature
          Components: data collection, Data Processors
            Reporter: Jerome Boulon
            Priority: Critical


We have a shell script in order to expire files older than x days.
The idea is to move this to ChukwaAgent in order to reduce the number of daemon 
running and the overall complexity.

This could be done in a background/low priority Cleanup Thread inside the agent.
The input should be a configuration file with:
- path + fileName + regex (similar to the expiration in our Log4J appender)
- number of days (last accessed/modified? date)

The initial version could be as simple as this but a more advanced version will 
be needed.
The advanced version should be able to track data completeness based on 
CHUKWA-203 and only remove file that has been completely sent over or generate 
alerts.



-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to