szehon-ho commented on issue #3582: URL: https://github.com/apache/iceberg/issues/3582#issuecomment-974250575
Yea makes sense, thanks guys for the reply. Look forward to having the driver parallelism in RemoveOrphans, let me know if you have the pr / issue. It may still make sense to introduce a batch mode (the second point) to take advantage of storage-system bulk deletes? HadoopFileSystem seems like it does not have it, but S3 does: (https://docs.aws.amazon.com/AmazonS3/latest/API/API_DeleteObjects.html), can save 1000 HTTP calls. I can take a look to see if it's not too ugly to add. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
