mridulm commented on code in PR #37638:
URL: https://github.com/apache/spark/pull/37638#discussion_r1050430333


##########
common/network-shuffle/src/main/java/org/apache/spark/network/shuffle/RemoteBlockPushResolver.java:
##########
@@ -1197,15 +1230,15 @@ public void onData(String streamId, ByteBuffer buf) 
throws IOException {
             
appShuffleInfo.shuffles.get(partitionInfo.appAttemptShuffleMergeId.shuffleId);
         if (isStale(info, 
partitionInfo.appAttemptShuffleMergeId.shuffleMergeId) ||

Review Comment:
   I would look at these metrics as a means for identifying the effectiveness 
of push based shuffle, what is impacting the inefficiencies (how many stale 
blocks, how many late blocks, how that distribution is changing with time), etc.
   These will correlate in some way to overall impact on when push based 
shuffle was used, when normal shuffle was used, and what the network and task 
impact of those are.
   
   Do let me know if you have thoughts !



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to