Take a look at this one: https://issues.apache.org/jira/browse/SPARK-1188
It was an optimization that added user inconvenience. We got rid of that now in Spark 1.0. On Wed, May 28, 2014 at 11:48 PM, Michael Malak <michaelma...@yahoo.com>wrote: > Shouldn't I be seeing N2 and N4 in the output below? (Spark 0.9.0 REPL) Or > am I missing something fundamental? > > > val nodes = sc.parallelize(Array((1L, "N1"), (2L, "N2"), (3L, "N3"), (4L, > "N4"), (5L, "N5"))) > val edges = sc.parallelize(Array(Edge(1L, 2L, "E1"), Edge(1L, 3L, "E2"), > Edge(2L, 4L, "E3"), Edge(3L, 5L, "E4"))) > Graph(nodes, edges).triplets.collect > res1: Array[org.apache.spark.graphx.EdgeTriplet[String,String]] = > Array(((1,N1),(3,N3),E2), ((1,N1),(3,N3),E2), ((3,N3),(5,N5),E4), > ((3,N3),(5,N5),E4)) >