hililiwei opened a new pull request #4316:
URL: https://github.com/apache/iceberg/pull/4316


   After we update the data via `upsert`, if we use the field not in the 
`identifierFieldIds` to query the data, we may get inaccurate result.
   
   Because the metrics values of non-identifierFieldIds in the manifest file 
are new, and these new data may be hit by the `row filter`, resulting in the 
old data being updated, but still placed in the result set.
   
   In this PR, try to come up with a solution. When filtering the delete 
manifest file, if the `identifierFieldIds` is not empty, only the fields in the 
`identifierFieldIds` are retained in the `row filter`, so that the fields in 
the non-identifierFieldIds are always processed as True.
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]



---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to