JindongZhang created SPARK-35751:
------------------------------------

             Summary: Support Joint eviction strategies for cached RDD 
partitions
                 Key: SPARK-35751
                 URL: https://issues.apache.org/jira/browse/SPARK-35751
             Project: Spark
          Issue Type: Improvement
          Components: Block Manager
    Affects Versions: 3.1.2
            Reporter: JindongZhang


As of now, there's as it were ousting procedure for cached RDD segment in 
Spark. The default RDD removal methodology is LRU .When memory space not 
adequate for RDD caching, a few allotments will be ousted, on the off chance 
that these segments are utilized once more latterly, they will be replicated by 
the Ancestry data and cached in memory once more. The replicate stage will 
bring in extra taken a toll. Be that as it may, LRU has no ensure for the most 
reduced duplicate cost. The to begin with RDD that required to be cached is 
ordinarily created by perusing from HDFS and doing a few changes. The perusing 
operation ordinarily fetched longer time than other Start transformations. 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to