LiangDai-Mars opened a new pull request, #7033:
URL: https://github.com/apache/paimon/pull/7033

   ### Purpose
   
   This pull request aims to fix an issue in the incremental scan where 
concurrent writes to the same key could finally return wrong merged result. 
Since records included in merging set are in non-deterministic sequence numbers 
 could result in unstable merge order, which the incremental scan heavily 
relies on, would lead to incorrect final results. The fix ensures the correct 
result by re-sorting the data in the getResult stage of the split diff read 
process.
   
   ### Tests
   
   A new integration test, testIncrementScanModeWithInsertOverwrite, has been 
added to BatchFileStoreITCase.java. This test simulates the scenario with 
multiple INSERT OVERWRITE operations with different seq. number on the same 
key, and then verifies that the diff mode of incremental scan correctly 
identifies the changes, ensuring the fix is effective.
   
   ### API and Format
   
   No API or format changes. This change only involves an internal logic 
correction and does not affect any external APIs, data storage formats, or 
configuration files.
   
   ### Documentation
   
   
   This change is an internal bug fix and does not require updates to 
user-facing documentation.
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to