otterc commented on a change in pull request #30433:
URL: https://github.com/apache/spark/pull/30433#discussion_r539871040



##########
File path: 
common/network-shuffle/src/main/java/org/apache/spark/network/shuffle/RemoteBlockPushResolver.java
##########
@@ -586,6 +592,7 @@ public void onData(String streamId, ByteBuffer buf) throws 
IOException {
             deferredBufs = null;
             return;
           }
+          abortIfNecessary();

Review comment:
       Even If the write succeeds, I don't see an issue. If there are no 
further failures with this block it will be merged successfully. If there are 
more failures later, then they will be handled in the same way as now.  I could 
add UTs to verify this as well.
   
   > we still receive data in onData but just not write when the partition 
already reach IO threshold and only call abortIfNecessary inside `onComplete`.
   
   My thought about this is that if a block is completing and doesn't have any 
deferred bufs then we should let it complete without failures if possible. 
   
   Again, I don't have a strong opinion about this. IMO reaching the threshold 
is indicative of a persistent issue and any further writes will faill.




----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]



---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to