Re: [P2PSIP] Section 9 of RELOAD - Comments

glitch Mon, 11 May 2009 15:42:49 -0700

The RELOAD routing is fundamentally flawed and for that reason I amtossing the Chord implementation that it defines and replacing"periodic stabilization" with "correction on use and correct onchange" also I am ditching finger tables all together as they aren'tneeded whatsoever with these changes. Also this brings globalbroadcast and multicast that can be registered via the hash tablewhich are very useful for ICE and any other signaling with littleoverhead.

I guess I will have to write my own RFC since RELOAD is IMHO usingoutdated overlay technology and since having said that shouldn't it bea flawless design and not broken right now? I think so.

@Narayan You could not be more wrong about "the number of fingers anode has should provide a rough estimate (with accuracy in the orderof log N) of the size of the overlay". People use this formula inKadmelia, Chord, Chimera, etc and it does not work whatsoever and youraccuracy is even wrong too. I use the algorithm described here: http://dks.sics.se/pub/netsize.pdf


-g

On May 11, 2009, at 12:47 PM, Narayanan, Vidya wrote:

I've described a number of comments on section 9 below. But, if theresponses can be separated out under each topic, it will help thediscussion quite a bit, given the length of this email.
Routing (Section 9.3):
----------------------
The routing rule as specified does not work in all cases. If a nodewith ID N and immediate successor S receives a message for resourceid k such that N < k < S, since k is neither a node directlyattached to N and nor is there a node whose ID is between N and k,the message cannot be routed. The routing rule should instead readas follows:
"If a peer is not responsible for a Resource-ID k, but is directlyconnected to a node with Node-ID k, then it routes the message tothat node. Otherwise, it routes the request to the peer in therouting table that has the largest Node-ID that is in the intervalbetween the peer and k. **If no such node is found, it finds thesmallest node id that is greater than k and routes the message tothat node.** The routing table is the union of the neighbor tableand the finger table."
The new text is as enclosed in "**  **" above.

Redundancy (Section 9.4):
-------------------------
On the open issue, there should be no need to wait for successfulredundant copy storage to return a STORE response. It doesn't makesense to send a failure if one of the redundant storage attemptsfails anyway - so, the wait becomes a pointless exercise.
  "Note that a malicious node can return a success response but not
  store the data locally or in the replica set.  Requesting peers that
  wish to ensure that the replication actually occurred SHOULD [[Open
  Issue:  SHOULD or MAY?]] contact each peer listed in the replicas
  field of the Store response and retrieve a copy of the data."
The above text is not just relevant for redundant copies of datastored - it is true for even the actual resource-id owner. Also, inlight of not waiting for the replica responses, the STORE responsewill not contain a list of replicas - so, this needs to be corrected.
Joining (Section 9.5):
----------------------
It is not clear why the join procedure requires all neighbor andfinger connections to be established prior to sending the JOINmessage itself. To be functional in the overlay, only the immediatesuccessor relationships must be correctly in place. The othersuccessors, predecessors and fingers can be establishedsubsequently. This also doesn't cause nodes to establishunnecessary connections unless the node has successfully joined.There is also the question of whether a JP is supposed to establisha TLS connection to the BP to send messages through it, which ispresently not addressed in the draft. It seems to me that a morestreamlined join process can be defined as follows:
1. JP connects to its chosen bootstrap node. JP MUST form a TLSsession with the bootstrap node, using the certificate or PSKrequired for overlay membership.2. JP sends a Join request message to its own Node-ID n, throughthe BP. This is routed to the admitting peer (AP). The AP sends aresponse to the Join.3. JP also sends an Attach request message to n through the BP.This may be sent in parallel with the Join message. The AP sends anattach response, resulting in connection establishment between theJP and AP.4. AP does a series of Store requests to JP to store the data thatJP will be responsible for.5. AP sends JP an Update explicitly labeling JP as itspredecessor. AP also sends an Update to all its other neighborswith the new value of its neighbor set (including JP). At thispoint, JP is part of the ring and responsible for a section of theoverlay. AP SHOULD now store any data that JP is responsible for aspart of replicas it will store (since AP is still JP's successor, itwill need to store the data as well anyway).6. JP sends Attach requests to the neighbors and establishesconnections with the required successors and predecessors.7. JP establishes finger connections by sending Attach requests topeers (n+2^i). (Comments on finger selection below).
Updates (Section 9.7):
----------------------
Section 9.7.1. states that every time a connection is lost, the peershould remove it from its neighbor table and find a different peer.Section 9.7.3. talks about attempting to re-establish connectionswhen a neighbor is detected to be invalid. These are at odds. Itseems to make sense that the peer would always try to re-establishconnections and only remove that neighbor if the connection cannotbe re-established.
Further, having the enrollment server involved in specifying thefrequency of updates and probes seems strange. How is theenrollment server supposed to know the magic values here? It seemslike it should be enough to specify a recommended value and leave itat that. Since the finger stabilization is for optimization and theneighbor stabilization is reactive anyway in the present draft,having these parameters in the config document seem to be of littleuse.
Finger entries (Section 9.7.3 and elsewhere):
---------------------------------------------
The draft requires 16 finger table entries - at first, I took thoseto mean 16 different resource ids that will be probed per formula.After reading 9.7.3, it appears that it is talking about 16different fingers that a node should maintain. In a small overlay,that's ridiculously large. The text is also unclear on what happenswhen the overlay shrinks. Further, the text states that a peerSHOULD consider the finger table entry valid if it is in the range[n + 2^(numBitsInNodeId-i), n + 1.5 x 2^(numBitsInNodeId-i)]. Forsmall overlays, there may be no entries in this range for every i in[0, 127].
A node should only need log N fingers to route in OlogN hops. Whatwe need to specify is an algorithm that uses the same pattern inresource ids to probe - the fingers corresponding to multiple ofthose will collapse to the same node in a small overlay. In ourimplementation, the algorithm works as follows:
A node looks at the list of resource-ids corresponding to f=(n+2^i),for all values of i=[0,127]. Let's say that it first probes n+2^0and obtains a finger of f0. It checks f0 against the next set ofresource-ids to probe and determines the next larger resource-id toprobe from the list. It populates the finger table entries for allother fingers f for which n < f < f0, with f0 as the correspondingfinger. Depending on the number of nodes in the overlay, thisshould result in approximately logN fingers in the finger table atany time. Periodic stabilization should adjust the number offingers automatically. In fact, the number of fingers a node hasshould provide a rough estimate (with accuracy in the order of logN) of the size of the overlay. The current text suggests that thenumber of fingers should be decided based on the overlay size, whichdoes not seem intuitive or necessary.
Leaving (Section 9.9):
----------------------
In the event of a graceful leave, a peer should be able to sendupdates to all members of the neighbor set. It seems redundant tohave the peer send a Leave message, followed by every peer receivingthe Leave to send updates. Is there a reason for this?
Thanks,
Vidya
_______________________________________________
P2PSIP mailing list
[email protected]
https://www.ietf.org/mailman/listinfo/p2psip

_______________________________________________
P2PSIP mailing list
[email protected]
https://www.ietf.org/mailman/listinfo/p2psip

Re: [P2PSIP] Section 9 of RELOAD - Comments

Reply via email to