Iulian Dragos created SPARK-10975:
-------------------------------------

             Summary: Shuffle files left behind on Mesos without dynamic 
allocation
                 Key: SPARK-10975
                 URL: https://issues.apache.org/jira/browse/SPARK-10975
             Project: Spark
          Issue Type: Bug
          Components: Mesos
    Affects Versions: 1.5.1
            Reporter: Iulian Dragos
            Priority: Blocker


(from mailing list)

Running on Mesos in coarse-grained mode. No dynamic allocation or shuffle 
service. 

I see that there are two types of temporary files under /tmp folder associated 
with every executor: /tmp/spark-<UUID> and /tmp/blockmgr-<UUID>. When job is 
finished /tmp/spark-<UUID> is gone, but blockmgr directory is left with all 
gigabytes in it. 

The reason is that logic to clean up files is only enabled when the shuffle 
service is running, see https://github.com/apache/spark/pull/7820

The shuffle files should be placed in the Mesos sandbox or under `tmp/spark` 
unless the shuffle service is enabled.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to