Re: GraphX graph partitioning strategy

Larry Xiao Wed, 17 Sep 2014 18:52:55 -0700

Hi Ankur, all,

I've implemented few graph partitioning algorithms, and done someevaluation.The goal is to lower replication factor and produce better balancedgraph, so to make work load more balance.Detailed description and result:https://issues.apache.org/jira/browse/SPARK-3523


Can you help take a look?
Thank you!

Larry

On 7/24/14 2:59 PM, Larry Xiao wrote:

Hi all,
I'm implementing graph partitioning strategy for GraphX, learning fromresearches on graph computing.
I have two questions:

- a specific implement question:
In current design, only vertex ID of src and dst are provided(PartitionStrategy.scala).And some strategies require knowledge about the graph (like degrees)and can consist more than one passes to finally produce the partition ID.So I'm changing the PartitionStrategy.getPartition API to provide moreinfo, but I don't want to make it complex. (the current one looks veryclean)
- an open question:
What advice would you give considering partitioning, considering theprocedure Spark adopt on graph processing?
Any advice is much appreciated.

Best Regards,
Larry Xiao

Reference

Bipartite-oriented Distributed Graph Partitioning for Big Learning.
PowerLyra : Differentiated Graph Computation and Partitioning onSkewed Graphs



---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@spark.apache.org
For additional commands, e-mail: dev-h...@spark.apache.org

Re: GraphX graph partitioning strategy

Reply via email to