krisnaru opened a new pull request, #14354:
URL: https://github.com/apache/iceberg/pull/14354

   ### Description:
   We need to add comprehensive logging and metrics instrumentation to the 
Iceberg expire snapshots functionality to help diagnose and prevent production 
issues where the expire snapshot job deletes more files than expected.
   
   ### Background:
   We encountered a production issue where an expire snapshot job deleted 
significantly more files than expected, making the table unreadable and causing 
queries to fail with java.io.FileNotFoundException: File does not exist.
   
   ### Requirements:
   Log the list of files being deleted during snapshot expiration
   Log the reason for each data file why it was marked for deletion
   Provide sufficient instrumentation to trace the root cause of unexpected 
file deletions
   
   ### Metrics:
   Track the number of files being deleted
   Monitor snapshot expiration operations
   Alert on anomalies in deletion patterns


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to