Hi,
Vertices are simply hash-partitioned by spark.HashPartitioner, so
you easily calculate partition ids by yourself.
Also, you can type the lines to check ids;
import org.apache.spark.graphx._
graph.vertices.mapPartitionsWithIndex { (pid, iter) =
val vids = Array.newBuilder[VertexId]
for
Thanks for all the responses so far! I have started to understand the
system more, but I just had another question while I was going along. Is
there a way to check the individual partitions of an RDD? For example, if I
had a graph with vertices a,b,c,d and it was split into 2 partitions could
I
or something?
Thank you very much for your insight,
Matthew Bucci
--
View this message in context:
http://apache-spark-user-list.1001560.n3.nabble.com/Learning-GraphX-Questions-tp21649.html
Sent from the Apache Spark User List mailing list archive at Nabble.com
At 2015-02-13 12:19:46 -0800, Matthew Bucci mrbucci...@gmail.com wrote:
1) How do you actually run programs in GraphX? At the moment I've been doing
everything live through the shell, but I'd obviously like to be able to work
on it by writing and running scripts.
You can create your own