leesf commented on PR #7365:
URL: https://github.com/apache/hudi/pull/7365#issuecomment-1343707001

   > we have two operations relating to insert_overwrite. 1: 
insert_overwrite_table 2: insert_overwrite.
   > 
   > spark-ds writes supports both operations. insert_overwrite_table will 
override entire table. while insert_overwrite will overwrite only matching 
partitions.
   > 
   > guess in spark-sql, we supported only insert_overwrite. not sure if we can 
revert the behavior. May be we should consider adding a new write operation in 
spark-sql for this.
   
   @nsivabalan hi, here are my two cents: `insert overwrite xxx values(xx,xxx)` 
has very clear semantics, it means overwrite the entire table, `insert 
overwrite xx partition(xx) values(xx,xxx)` means insert overwrite partitions, 
but hudi handles overwrite partitions for `overwrite table`, which is a 
definite bug and i do not think we need to introduce a new operation for it. 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to