vincentpoon opened a new issue, #5936:
URL: https://github.com/apache/iceberg/issues/5936

   ### Apache Iceberg version
   
   0.14.0
   
   ### Query engine
   
   Trino
   
   ### Please describe the bug 🐞
   
   With a simple table partitioned by one identity column, e.g. `partCol int`
   Insert 2 rows, where partCol=1 and partCol=2
   Now DELETE where partCol=2
   
   The manifest list will show that the new manifest  partition stats for 
`partCol` are lower=1 upper=2
   That is, it incorporates the stats for all files in the manifest, including 
those with `status: 2` (i.e. deleted file)
   
   This has query perf implications in that a query `SELECT * FROM table WHERE 
partCol=2` must still read the manifest.
   
   It is only after the manifest is further rewritten, when the `status: 2` 
file is evicted, that the column stats are accurately reflected.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to