Copilot commented on code in PR #3653:
URL: https://github.com/apache/celeborn/pull/3653#discussion_r3044925821


##########
worker/src/main/scala/org/apache/celeborn/service/deploy/worker/PushDataHandler.scala:
##########
@@ -1456,8 +1456,10 @@ class PushDataHandler(val workerSource: WorkerSource) 
extends BaseMessageHandler
          |""".stripMargin)
     val diskFileInfo = fileWriter.getDiskFileInfo
     if (diskFileInfo != null) {
-      if (workerPartitionSplitEnabled && ((diskFull && 
diskFileInfo.getFileLength > partitionSplitMinimumSize) ||
-          (isPrimary && diskFileInfo.getFileLength > 
fileWriter.getSplitThreshold))) {
+      if (workerPartitionSplitEnabled && diskFull && 
(diskFileInfo.getFileLength >= partitionSplitMinimumSize)) {

Review Comment:
   This changes the diskFull split threshold from `> partitionSplitMinimumSize` 
to `>= partitionSplitMinimumSize`. If the intent of this PR is only to force 
HARD_SPLIT (not change the minimum-size boundary), consider keeping the strict 
`>` comparison or add a brief comment/commit message rationale for why equality 
should now trigger a split.
   ```suggestion
         if (workerPartitionSplitEnabled && diskFull && 
(diskFileInfo.getFileLength > partitionSplitMinimumSize)) {
   ```



##########
worker/src/main/scala/org/apache/celeborn/service/deploy/worker/PushDataHandler.scala:
##########
@@ -1456,8 +1456,10 @@ class PushDataHandler(val workerSource: WorkerSource) 
extends BaseMessageHandler
          |""".stripMargin)
     val diskFileInfo = fileWriter.getDiskFileInfo
     if (diskFileInfo != null) {
-      if (workerPartitionSplitEnabled && ((diskFull && 
diskFileInfo.getFileLength > partitionSplitMinimumSize) ||
-          (isPrimary && diskFileInfo.getFileLength > 
fileWriter.getSplitThreshold))) {
+      if (workerPartitionSplitEnabled && diskFull && 
(diskFileInfo.getFileLength >= partitionSplitMinimumSize)) {
+        return StatusCode.HARD_SPLIT
+      }

Review Comment:
   When diskFull triggers a HARD_SPLIT here, the method returns without any 
info/warn log (only the earlier logTrace). In production, trace is often 
disabled, so disk-full driven splits may be hard to diagnose. Consider adding a 
logInfo/logWarning in this branch (include mount point / usable space if 
available) similar to the hard-split log below.



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to