szehon-ho commented on issue #5058: URL: https://github.com/apache/iceberg/issues/5058#issuecomment-1177003302
I think the way to do this is to write a Spark job that finds 'dangling' delete files (that is, delete files that don't point to any live data file). I think once this pr is in: https://github.com/apache/iceberg/pull/4812, we can implement the new Spark action 'removeDanglingDeleteFile'. I think this was in the original delete file design doc https://docs.google.com/document/d/1-EyKSfwd_W9iI5jrzAvomVw3w1mb_kayVNT7f2I-SUg/edit#heading=h.fxypqdd7zxcj -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
