Patrick Liu created SPARK-3875:
----------------------------------
Summary: Add TEMP DIRECTORY configuration
Key: SPARK-3875
URL: https://issues.apache.org/jira/browse/SPARK-3875
Project: Spark
Issue Type: Improvement
Components: Spark Core
Affects Versions: 1.1.0
Reporter: Patrick Liu
Currently, the Spark uses "java.io.tmpdir" to find the /tmp/ directory.
Then, the /tmp/ directory is used to
1. Setup the HTTP File Server
2. Broadcast directory
3. Fetch Dependency files or jars by Executors
The size of the /tmp/ directory will keep growing. The free space of the system
disk will be less.
I think we could add a configuration "spark.tmp.dir" in conf/spark-env.sh or
conf/spark-defaults.conf to set this particular directory. Let's say, set the
directory to a data disk.
If "spark.tmp.dir" is not set, use the default "java.io.tmpdir"
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]