Alexander Pivovarov created SPARK-12655:
-------------------------------------------

             Summary: GraphX does not unpersist RDDs
                 Key: SPARK-12655
                 URL: https://issues.apache.org/jira/browse/SPARK-12655
             Project: Spark
          Issue Type: Bug
          Components: GraphX
    Affects Versions: 1.5.2
            Reporter: Alexander Pivovarov


Looks like Graph does not clean all RDDs from the cache on unpersist
{code}
// open spark-shell 1.5.2
// run

import org.apache.spark.graphx._

val vert = sc.parallelize(List((1L, 1), (2L, 2), (3L, 3)), 1)
val edges = sc.parallelize(List(Edge[Long](1L, 2L), Edge[Long](1L, 3L)), 1)

val g0 = Graph(vert, edges)
val g = g0.partitionBy(PartitionStrategy.EdgePartition2D, 2)
val cc = g.connectedComponents()

cc.unpersist()
g.unpersist()
g0.unpersist()
vert.unpersist()
edges.unpersist()

// open http://localhost:4040/storage/
// Spark UI 4040 Storage page still shows 2 items

// VertexRDD    Memory Deserialized 1x Replicated       1       100%    1688.0 
B        0.0 B   0.0 B
// EdgeRDD      Memory Deserialized 1x Replicated       2       100%    4.7 KB  
0.0 B   0.0 B
{code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to