nsivabalan edited a comment on issue #2992: URL: https://github.com/apache/hudi/issues/2992#issuecomment-847899578
@ayush71994 : 1. May I know which config you are referring to here "delete.duplicates"? Can you point me to full config from here https://hudi.apache.org/docs/configurations.html. Do you refer to https://hudi.apache.org/docs/configurations.html#INSERT_DROP_DUPS_OPT_KEY ? 2. And with your insert overwrite operation, does your new dataframe has duplicates and you wish to dedup before overwriting? 3. Can you confirm that hudi table had data in partitions matching data with batch used for insert_overwrite. CC @satishkotha -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected]
