aokolnychyi commented on code in PR #4503:
URL: https://github.com/apache/iceberg/pull/4503#discussion_r846544116
##########
api/src/main/java/org/apache/iceberg/actions/DeleteOrphanFiles.java:
##########
@@ -80,6 +80,19 @@
*/
DeleteOrphanFiles executeDeleteWith(ExecutorService executorService);
+ /**
+ * Passes a table which contains the list of actual files in the table. This
skips the directory listing - any
+ * files in the actualFilesTable provided which are not found in table
metadata will be deleted. Not compatible
+ * with `location` or `older_than` arguments - this assumes that the
provided table of actual files has been
+ * filtered down to the table’s location and only includes files older than
a reasonable retention interval.
+ *
+ * @param tableName the table containing the actual files dataset. Should
have a single `file_path` string column
+ * @return this for method chaining
+ */
+ default DeleteOrphanFiles actualFilesTable(String tableName) {
Review Comment:
I guess the problem is that the Action API is engine agnostic so we cannot
refer to `Dataset`. We considered offering engine-specific actions at some
point but that idea was discarded as it would overcomplicate everything.
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]