Gabor Kaszab created IMPALA-12599:
-------------------------------------

             Summary: Support equality delete files that don't contain the 
partition values
                 Key: IMPALA-12599
                 URL: https://issues.apache.org/jira/browse/IMPALA-12599
             Project: IMPALA
          Issue Type: Sub-task
          Components: Frontend
            Reporter: Gabor Kaszab


When you write equality delete files with Flink the partition columns have to 
also be part of the primary key. As a result the partition values will be added 
into the equality delete files. However, the Iceberg spec is more flexible than 
that and it's also a valid case when the partition values aren't written into 
the eq-delete files.

To be able to read such tables Impala should also check if the partition spec 
and the partition values match between the data and delete files when applying 
the delete rows. This could be achieved by adding a virtual columns and 
conjuncts for the partition spec IDs and also for the partition values. These 
virtual columns already exist, but have to be added to the scan nodes, and the 
conjuncts have to be created for the ANTI JOIN node.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to