Lantao Jin created SPARK-30494:
----------------------------------

             Summary: Duplicates cached RDD when create or replace an existing 
view
                 Key: SPARK-30494
                 URL: https://issues.apache.org/jira/browse/SPARK-30494
             Project: Spark
          Issue Type: Bug
          Components: SQL
    Affects Versions: 3.0.0
            Reporter: Lantao Jin


We can reproduce by below commands:
{code}
beeline> create or replace temporary view temp1 as select 1
beeline> cache table tempView
beeline> create or replace temporary view temp1 as select 1, 2
beeline> cache table tempView

The cached RDD for plan "select 1" stays in memory forever until the session 
close. This cached data cannot be used since the view temp1 has been replaced 
by another plan. It's a memory leak.

assert(spark.sharedState.cacheManager.lookupCachedData(sql("select 1, 
2")).isDefined)
assert(spark.sharedState.cacheManager.lookupCachedData(sql("select 
1")).isDefined)



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to