nsivabalan commented on issue #8034: URL: https://github.com/apache/hudi/issues/8034#issuecomment-1453965150
I can explain whats happening under the hood. not sure how we can fix it properly. Might need to think deep. After step 8 above, delete of id=1 goes into a log file in hudi_table2. So, if you do a sanpshot read from table2, you will not see id=1 record. But if you do an index look up, it might show as though id=1 belongs to hudi_table2 untill compaction kicks in. So, during step9, the merge into results in an index lookup (when not matched), both id=1 and id=2 are seen as valid records from hudi_table2. and so it does not re-insert anything. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
