Kishen Das created HIVE-24275:
---------------------------------

             Summary: Introduce a configuration to delay the deletion of 
obsolete files by the Cleaner
                 Key: HIVE-24275
                 URL: https://issues.apache.org/jira/browse/HIVE-24275
             Project: Hive
          Issue Type: New Feature
            Reporter: Kishen Das


Whenever compaction happens, the cleaner immediately deletes older obsolete 
files. In certain cases it would be beneficial to retain these for certain 
period. For example : if you are serving the file metadata from cache and don't 
want to invalidate the cache during compaction because of performance reasons. 

For this purpose we should introduce a configuration 
hive.compactor.delayed.cleanup.enabled, which if enabled will delay the 
cleaning up obsolete files. There should be a separate configuration 
CLEANER_RETENTION_TIME to specific the duration till which we should retain 
these older obsolete files. 

It might be beneficial to have one more configuration to decide whether to 
retain files involved in an aborted transaction 
hive.compactor.aborted.txn.delayed.cleanup.enabled . 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to