difin commented on code in PR #5540:
URL: https://github.com/apache/hive/pull/5540#discussion_r1932738256


##########
iceberg/iceberg-handler/src/main/java/org/apache/iceberg/mr/hive/compaction/IcebergCompactionUtil.java:
##########
@@ -55,6 +60,12 @@ public static boolean shouldIncludeForCompaction(Table 
table, String partitionPa
             
table.specs().get(file.specId()).partitionToPath(file.partition()).equals(partitionPath);
   }
 
+  public static boolean shouldIncludeForCompaction(Table table, String 
partitionPath, ContentFile<?> file,

Review Comment:
   This method is used for Minor compaction which uses a file size threshold 
clause. It is called from HiveIcebergOutputCommitter for finding data files 
that need to be rewritten at commit time based on file size in bytes without 
checking number of delete files.
   In minor compaction we construct a compaction query with a data file size 
threshold.
   Even Amoro compaction evaluator doesn't check number of delete files of a 
Fragment files, it is based on the Fragment file size.



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to