mridulm commented on code in PR #37922:
URL: https://github.com/apache/spark/pull/37922#discussion_r1067551485


##########
common/network-shuffle/src/main/java/org/apache/spark/network/shuffle/ExternalBlockStoreClient.java:
##########
@@ -256,6 +256,22 @@ public void onFailure(Throwable e) {
     }
   }
 
+  @Override
+  public boolean removeShuffleMerge(String host, int port, int shuffleId, int 
shuffleMergeId) {
+    checkInit();
+    try {
+      TransportClient client = clientFactory.createClient(host, port);
+      client.send(
+          new RemoveShuffleMerge(appId, comparableAppAttemptId, shuffleId, 
shuffleMergeId)

Review Comment:
   Failure to remove blocks in `removeBlocks` has an impact on resource 
utilization (for example, block could be in memory). Here, it is more of a best 
effort cleanup - and can fail in context of decommissioned nodes, etc - log 
pollution is something I am concerned with here.
   
   Having said that - the exact same problem exists for 
`removeShuffle.removeShuffleFromShuffleServicesFutures` unfortunately - and 
that is an issue : I would argue for reducing that log message from `warn` to 
`info` (or even `debug` ) in `ExternalBlockStoreClient.removeBlocks` when 
failing to remove shuffle files from ESS.
   
   We can make changes here to log in `info` - and make changes to 
`removeBlocks` for ESS block removal later.
   
   Thoughts ?
   
   +CC @tgravescs who reviewed #35085



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

Reply via email to