[
https://issues.apache.org/jira/browse/GIRAPH-26?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sean Choi updated GIRAPH-26:
----------------------------
Attachment: GIRAPH-26-1.patch
Adding a more realistic graph generator based on this paper
cs.stanford.edu/~jure/pubs/kronFit-icml07.pdf
The input takes two arguments
-K which represents the number of Kronecker power
-M which represents the initial matrix in matlab notation
for example the arguments
-K 10 -M "0.9 0.5; 0.5 0.1;"
will give a matrix of size 2^10 with randomly generated edges with power law
distributions and all the goodies...
> Improve PseudoRandomVertexInputFormat to create a more realistic synthetic
> graph (e.g. power-law distributed vertex-cardinality).
> ---------------------------------------------------------------------------------------------------------------------------------
>
> Key: GIRAPH-26
> URL: https://issues.apache.org/jira/browse/GIRAPH-26
> Project: Giraph
> Issue Type: Test
> Components: benchmark
> Reporter: Jake Mannix
> Assignee: Jake Mannix
> Priority: Minor
> Attachments: GIRAPH-26-1.patch
>
>
> The PageRankBenchmark class, to be a proper benchmark, should run over graphs
> which look more like data seen in the wild, and web link graphs, social
> network graphs, and text corpora (represented as a bipartite graph) all have
> power-law distributions, so benchmarking a synthetic graph which looks more
> like this would be a nice test which would stress cases of uneven
> split-distribution and bottlenecks of subclusters of the graph of heavily
> connected vertices.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira