leesf commented on PR #7365: URL: https://github.com/apache/hudi/pull/7365#issuecomment-1343707001
> we have two operations relating to insert_overwrite. 1: insert_overwrite_table 2: insert_overwrite. > > spark-ds writes supports both operations. insert_overwrite_table will override entire table. while insert_overwrite will overwrite only matching partitions. > > guess in spark-sql, we supported only insert_overwrite. not sure if we can revert the behavior. May be we should consider adding a new write operation in spark-sql for this. @nsivabalan hi, here are my two cents: `insert overwrite xxx values(xx,xxx)` has very clear semantics, it means overwrite the entire table, `insert overwrite xx partition(xx) values(xx,xxx)` means insert overwrite partitions, but hudi handles overwrite partitions for `overwrite table`, which is a definite bug and i do not think we need to introduce a new operation for it. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
