lw309637554 commented on pull request #2196:
URL: https://github.com/apache/hudi/pull/2196#issuecomment-722512937


   @satishkotha 
   A. Thanks  so much. This pr need to solved the issue with better approach. 
   Now I am more clear about overwrite semantic between table.overwrite and  
spark sql overwrite for hudi.
   
   B. Also spark sql for hudi overwrite should have the ability just like spark 
sql 、hive 、 delta lake.
   these engine have three mode for overwrite about partition:
   1. Dynamic Partition : delete all partition data ,and the insert the new 
data for different 
   2. Static partition: just overwrite the partition which is user specified
   3. Mixed partition: mixed of 1 and 2
   more detail in : 
   
https://spark.apache.org/docs/3.0.0-preview/sql-ref-syntax-dml-insert-overwrite-table.html
   https://www.programmersought.com/article/47155360487/
   
   C. our plan
   1. Now spark sql for hudi overwrite is Dynamic Partition. I will resolved it 
in this issue HUDI-1349, first support delete all partition in 
[HUDI-1350](https://issues.apache.org/jira/browse/HUDI-1350) , then land this 
issue. (Just like @satishkotha 's Suggestion)
   2. Now spark sql for hudi does not support "Static partition" mode, will 
then land it in  [HUDI-1374](https://issues.apache.org/jira/browse/HUDI-1374)
   3. future support "Mixed partition" mode
   
   cc @n3nash @vinothchandar  please help to review  if  the plan about spark 
sql for hudi  overwirte is suitable. 
   It is also possible that my understanding is inappropriate 


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]


Reply via email to