satish created HUDI-868:
---------------------------
Summary: Insert Overwrite API
Key: HUDI-868
URL: https://issues.apache.org/jira/browse/HUDI-868
Project: Apache Hudi (incubating)
Issue Type: Improvement
Reporter: satish
Usecases:
- Tables where the majority of records change every cycle. So it is likely
efficient to write new data instead of doing upserts.
- Operational tasks to fix a specific corrupted partition. We can do 'insert
overwrite' on that partition with records from the source. This can be much
faster than restore and replay for some data sources.
The functionality will be similar to hive definition of 'insert overwite'. But,
doing this in Hoodie will provide better isolation between writer and readers.
I can share possible implementation choices and some nuances if the community
thinks this is a useful feature to add.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)