fengjian428 commented on issue #7745: URL: https://github.com/apache/hudi/issues/7745#issuecomment-1404623769
> the new insert records with the same key could be duplicated actually even if you are using the UPSERT operation let's say record-key1 does not exist in the current table, writer1 and writer2 both have a record with key 1 arriving at the same time record-key1 -> writer1-> tag->not exist->assign to a filegroup to write record-key1 -> writer2->tag->not exist->assign to a filegroup to write writer1&writer2 cannot guarantee the two records with the same key can be assigned to the same filegroup since they don't know each other -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
