vanzin commented on a change in pull request #24499: [SPARK-25888][Core] Serve 
local disk persisted blocks by the external service after releasing executor by 
dynamic allocation
URL: https://github.com/apache/spark/pull/24499#discussion_r280619556
 
 

 ##########
 File path: core/src/main/scala/org/apache/spark/deploy/worker/Worker.scala
 ##########
 @@ -104,6 +104,15 @@ private[deploy] class Worker(
   private val CLEANUP_NON_SHUFFLE_FILES_ENABLED =
     conf.get(config.STORAGE_CLEANUP_FILES_AFTER_EXECUTOR_EXIT)
 
+  val EXTERNAL_SHUFFLE_SERVICE_ENABLED = 
conf.get(config.SHUFFLE_SERVICE_ENABLED)
+
+  if (CLEANUP_NON_SHUFFLE_FILES_ENABLED && EXTERNAL_SHUFFLE_SERVICE_ENABLED) {
 
 Review comment:
   Are you sure that there isn't other stuff you can delete that are neither 
shuffle nor cached files?
   
   If really there isn't anything else to clean, it seems that now both of 
these config options are basically the same thing.
   
   If there is something, then that should still be done, otherwise you'll be 
using disk space unnecessarily.
   
   (There's a whole other discussion about how to even delete orphaned shuffle 
and cached files after the executor that created them goes away, but that's for 
a separate discussion.)

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[email protected]


With regards,
Apache Git Services

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to