jjtjiang commented on issue #5777: URL: https://github.com/apache/hudi/issues/5777#issuecomment-1158452178
> And those records will be merged in the compaction process, which could justify the result you see, i.e., no duplication after a while (after the compaction). Without deduplication, this configuration uses the default value of hoodie.datasource.write.insert.drop.duplicates, which is false. When we read mor table and there is duplicate data in the log, will the data not be deduplicated by default? I understand that it will be deduplicated when reading, maybe I understand it wrong. I will modify this parameter and test it again. thanks very much. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
