slfan1989 opened a new pull request, #669: URL: https://github.com/apache/iceberg-cpp/pull/669
## Summary This PR adds parallel file deletion support to `ExpireSnapshots` cleanup. The change is scoped to the internal cleanup path only: - Keeps the public `ExpireSnapshots` API unchanged. - Uses an internal default delete parallelism. - Splits cleanup files into chunks and deletes them concurrently. - Preserves the existing best-effort cleanup behavior. - Keeps `DeleteWith(...)` supported for custom deletion logic. ## Motivation `ExpireSnapshots` may need to delete many files during cleanup, including data files, manifest files, manifest list files, and statistics files. Deleting these files sequentially can slow down snapshot expiration for large tables. Parallelizing the cleanup deletion path improves throughput without changing the default behavior of `FileIO::DeleteFiles`. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
