Not that updated I guess, but I have some presentations that are kind of recent (but still prior to the vertex range changes in GIRAPH-11 unfortunately). Here is my most recent one from October 2011 (http://www.slideshare.net/averyching/20111014hortonworks). There are some folks working on presentations for FOSDEM (Claudio) and Berlin Buzzwords (Jakob). Maybe they have some up-to-date material?

Avery


On 1/22/12 7:23 PM, Deepak Nettem wrote:
Awesome!

I can't wait to try this out. Is there any other resources / updated documentation where I can get insight into how Giraph works internally?

I am specifically interested in understanding how Mapper-only makes this entire thing possible, what are the key-value pairs (since it's built on top of Hadoop), and where Zookeeper fits in.

Best,
Deepak

On Sun, Jan 22, 2012 at 5:30 PM, Avery Ching <ach...@apache.org <mailto:ach...@apache.org>> wrote:

    Hi Deepak,

    Answers inline.

    Happy sunday!

    Avery


    On 1/22/12 11:30 AM, Deepak Nettem wrote:
    Hi Folks,

    I have a Graph processing problem where after each iteration,
    some vertices get vanished. That is, they get merged into their
    neighbouring nodes based on certain conditions and the graph
    keeps getting simplified after each iteration.

    I was wondering if Giraph is worth trying for this.

    Giraph can support this graph mutation at any iteration in the
    graph.  See
    
https://svn.apache.org/repos/asf/incubator/giraph/trunk/src/main/java/org/apache/giraph/graph/MutableVertex.java,
    the method r emoveVertexRequest() will remove vertices for you
    between iterations.


    I was going through the documentation and it says that the input
    data has to be sorted.Why is this necessary?


    Ouch, this used to be a requirement, but no longer.  You can load
    vertices however you like.  The workers will forward them to the
    appropriate partition.


    Also, how does the  so called 'master' divide vertices in ranges?
    Does it use some kind of a range partitioner? If there is range
    partitioning, that's a problem for me, because of vanishing
    vertices - there will be load imbalance.

    Again, out of date documentation.  Please see
    https://issues.apache.org/jira/browse/GIRAPH-11 for the relevant
    change.  Let us know if you have any other questions.

    Best,
    Deepak


Reply via email to