Josh Rosen created SPARK-7689:
---------------------------------

             Summary: Deprecate spark.cleaner.ttl
                 Key: SPARK-7689
                 URL: https://issues.apache.org/jira/browse/SPARK-7689
             Project: Spark
          Issue Type: Improvement
            Reporter: Josh Rosen


With the introduction of ContextCleaner, I think there's no longer any reason 
for most users to enable the MetadataCleaner / {{spark.cleaner.ttl}} (except 
perhaps for super-long-lived Spark REPLs where you're worried about orphaning 
RDDs or broadcast variables in your REPL history and having them never get 
cleaned up, although I think this is an uncommon use-case).  I think that this 
property used to be relevant for Spark Streaming jobs, but I think that's no 
longer the case since the latest Streaming docs have removed all mentions of 
{{spark.cleaner.ttl}} (see 
https://github.com/apache/spark/pull/4956/files#diff-dbee746abf610b52d8a7cb65bf9ea765L1817,
 for example).

See 
http://apache-spark-user-list.1001560.n3.nabble.com/is-spark-cleaner-ttl-safe-td2557.html
 for an old, related discussion.  Also, see 
https://github.com/apache/spark/pull/126, the PR that introduced the new 
ContextCleaner mechanism.

We should probably add a deprecation warning to {{spark.cleaner.ttl}} that 
advises users against using it, since it's an unsafe configuration option that 
can lead to confusing behavior if it's misused.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to