rambleraptor commented on code in PR #15154:
URL: https://github.com/apache/iceberg/pull/15154#discussion_r2743551313


##########
spark/v4.1/spark/src/main/java/org/apache/iceberg/spark/actions/ExpireSnapshotsSparkAction.java:
##########
@@ -174,14 +182,71 @@ public Dataset<FileInfo> expireFiles() {
 
       // fetch valid files after expiration
       TableMetadata updatedMetadata = ops.refresh();
-      Dataset<FileInfo> validFileDS = fileDS(updatedMetadata);
 
-      // fetch files referenced by expired snapshots
+      // find IDs of expired snapshots

Review Comment:
   Can you add some comments to break up these code sections? I think it helps 
to understand the flow of the code



##########
spark/v4.1/spark/src/main/java/org/apache/iceberg/spark/actions/BaseSparkAction.java:
##########
@@ -405,7 +409,7 @@ public long totalFilesCount() {
     }
   }
 
-  private static class ReadManifest implements 
FlatMapFunction<ManifestFileBean, FileInfo> {
+  protected static class ReadManifest implements 
FlatMapFunction<ManifestFileBean, FileInfo> {

Review Comment:
   Just making this protected seems fine, but I'd love to get another opinion 
here. 



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to