satish created HUDI-1072:
----------------------------

             Summary: Reader changes to support clustering and insert overwrite
                 Key: HUDI-1072
                 URL: https://issues.apache.org/jira/browse/HUDI-1072
             Project: Apache Hudi
          Issue Type: Sub-task
            Reporter: satish


* Add metadata to track ‘replaced’ files. Replaced files are essentially file 
groups to be ignored. For ‘insert overwrite’ this is all existing files in the 
partition overwritten. For ‘clustering’, this is all file groups that are 
merged into a new set of file groups.
* Change Views to ignore replaced files (AbstractTableFileSystemView and all 
subclasses)
* Change cleaner to delete data files that have been replaced (Introduce a new 
policy?)
* Change archival to not delete active commits that have this special metadata 
if corresponding data files are not deleted.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to