satish created HUDI-1072:
----------------------------
Summary: Reader changes to support clustering and insert overwrite
Key: HUDI-1072
URL: https://issues.apache.org/jira/browse/HUDI-1072
Project: Apache Hudi
Issue Type: Sub-task
Reporter: satish
* Add metadata to track ‘replaced’ files. Replaced files are essentially file
groups to be ignored. For ‘insert overwrite’ this is all existing files in the
partition overwritten. For ‘clustering’, this is all file groups that are
merged into a new set of file groups.
* Change Views to ignore replaced files (AbstractTableFileSystemView and all
subclasses)
* Change cleaner to delete data files that have been replaced (Introduce a new
policy?)
* Change archival to not delete active commits that have this special metadata
if corresponding data files are not deleted.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)