nsivabalan edited a comment on issue #2992:
URL: https://github.com/apache/hudi/issues/2992#issuecomment-847899578


   @ayush71994 : 
   1. May I know which config you are referring to here "delete.duplicates"? 
Can you point me to full config from here 
https://hudi.apache.org/docs/configurations.html. Do you refer to 
https://hudi.apache.org/docs/configurations.html#INSERT_DROP_DUPS_OPT_KEY ? 
   2. And with your insert overwrite operation, does your new dataframe has 
duplicates and you wish to dedup before overwriting? 
   3. Can you confirm that hudi table had data in partitions matching data with 
batch used for insert_overwrite.
   
   CC @satishkotha 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]


Reply via email to