erikcw commented on issue #6567:
URL: https://github.com/apache/iceberg/issues/6567#issuecomment-1379415205

   I stumbled into the same issue with a slight twist.  I deleted all the rows 
from my table, however pyiceberg is still returning parquet files with those 
records.  Shouldn't those files no longer be in the current manifest?
   
   ```sql
   -- Executed in Athena
   DELETE FROM iceberg_test WHERE uid = '200441';
   
   select count(uid) from "iceberg_test"
   where uid = '200441';
   
   -- Returns 0.
   
   
   ```
   
   ```python
       # Glue catalog type.
       catalog = load_catalog("default")
       table = catalog.load_table("testing.iceberg_test")
   
       scan = table.scan(
           row_filter=NotEqualTo("uid", "200441"),  # Doesn't seem to make a 
difference with out without this line.
           selected_fields=("uid"),
       )
       files = [task.file.file_path for task in scan.plan_files()]
       # files all contain the deleted value.
      
   
   ```


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to