hililiwei opened a new pull request #4316: URL: https://github.com/apache/iceberg/pull/4316
After we update the data via `upsert`, if we use the field not in the `identifierFieldIds` to query the data, we may get inaccurate result. Because the metrics values of non-identifierFieldIds in the manifest file are new, and these new data may be hit by the `row filter`, resulting in the old data being updated, but still placed in the result set. In this PR, try to come up with a solution. When filtering the delete manifest file, if the `identifierFieldIds` is not empty, only the fields in the `identifierFieldIds` are retained in the `row filter`, so that the fields in the non-identifierFieldIds are always processed as True. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
