On Wed, Feb 3, 2010 at 4:41 PM, Edward J. Yoon <[email protected]> wrote:
>>> I think the needToVisit() function might as well need to communicate
>>> with other machine:
>
> Hmm, You're exactly right. In that example, needToVisit() function
> checks the IsVisited from some shared-space (e.g., HBase or DBMS, ...,
> etc). We wrote with intent to simplify it.

>From the pseudocode, I see that every Vertex will request IsVisited.
In a large graph, the HBase or DBMS will be overwhelmed by many tiny
requests from each Vertex.
Does the needToVisit() has "bulk query" that aggregates the tiny
requests into a single request?


> Integer is the distance at "Map<Vertex, Integer> input, Map<Vertex,
> Integer> nextQueue", but it could be replaced as other object. for
> example, new PathWeHaveFollowed(). Then, perhaps we need not some
> shared-space.

But we also need to know PathOthersHaveFollowed() don't we?
That's why we need HBase to store the global "visited" states of each node.

Felix Halim

Reply via email to