> On Apr 24, 2018, at 5:01 PM, Greg Stein <[email protected]> wrote:
>
> Let's go back to the start: stuff older than six months will be deleted.
> What could possibly need to be retained?
- Not every job runs every day. Some are extremely situational.
- Some users might have specifically marked certain data to be retained
for very specific reasons.
I know in my case I marked some logs to not be deleted because I was
using them to debug the systemic Jenkins build node crashes. I want to keep the
data to see if the usage numbers, etc, go down over time.
So yes, there may be some value to some of that data that will not be
obvious to an outside observer.
> Assume all jobs will be touched.
… which is why giving a directory listing of just the base directory
would be useful to see who needs to look. If INFRA is unwilling to provide that
data, then keep any directories that reference:
- precommit
- hadoop
- yarn
- hdfs
- mapreduce
- hbase
- yetus
Thanks!