[
https://issues.apache.org/jira/browse/SPARK-13687?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15227482#comment-15227482
]
holdenk commented on SPARK-13687:
---------------------------------
I'll take this one :)
> Cleanup pyspark temporary files
> -------------------------------
>
> Key: SPARK-13687
> URL: https://issues.apache.org/jira/browse/SPARK-13687
> Project: Spark
> Issue Type: Improvement
> Components: PySpark
> Affects Versions: 1.5.2, 1.6.0
> Reporter: Damir
>
> Every time parallelize is called it creates temporary file for rdd in
> spark.local.dir/spark-uuid/pyspark-uuid/ directory. This directory deletes
> when context is closed, but for long running applications with permanently
> opened context this directory growth infinitely.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]