node

Robert Hailey Fri, 04 Jan 2008 13:32:41 -0800


On Jan 4, 2008, at 2:30 PM, Matthew Toseland wrote:

On Friday 04 January 2008 18:32, Robert Hailey wrote:
Apparently until this revision 16886, (so long as any one node does
not timeout) a node will take as long as necessary to exhaustroutablepeers. Even long after the original requestor has given up on thatnode.
Yes. Is this bad? Obviously there are limits - if it gets a post-accepted
timeout on any one node it will finish the request.
Generally I think this is probably a good thing - the data iswanted, so whynot find it? It will be cached and will be transferred later. WithULPRs it
will even be transferred when we complete, despite the timeout.

So in the best case, the timeout value was just-wrong-enough foroverall network topology towards this address. In the worst case, thedata does not exist (e.g. looking for a newer USK, or the next frostmessage)... which might actually be the common case.

In either case, resuming a request after we know that the upstreampeer has forgotten about it could be very bad. Assuming 20 peers (alaopennet), the theoretical worst-case-per-node is that the last newrequest will leave the node about 40 minutes from when it entered thenode. To the best of my knowledge, all of the upstream nodes will notrespond with the LOOP rejection before then. And even well before theworst case, this effect can accrue across many nodes in the path.

I suspect good and bad from this. On the one hand, all nodes will
become much less busy (increase throughput, decrease latency), butthedata may be not be found at all (as presently the node may continueto
search all its peers and it be cached for next time).
Right, the advantage of the current system is that the data willprobably befound eventually, and the next time the request is made it will beavailablebefore it times out. But if it's hard to find, if we timeout as yousuggest,
it may not be found.

I think the more-correct solution to that problem is to route less tonodes which take so long, or choose them last. Which is to say, if acertain node (or path to a node) is making us timeout, avoid it;benefiting the general network health and responsiveness.

The only real reason I can think of to timeout as you do below wouldbe forload balancing fairness: If you make a request, you had better beprepared(bandwidth wise) to accept the resulting data, if you're not, that'sa denialof service attack. Load management relies on propagating the loadback to therequestor. But the requestor can have only a very limited influenceon timing
out here so I don't consider it to be valid.

Perhaps related to this is the furthestStoreSuccess stat, which willusually near-instantly stick to ~0.5. Sometimes exactly 0.5... whichmeans that (wherever the request came from), my node was the absolute-farthest node possible, last on his list to call, and all my peerswhere closer. But weirdest of all, that the data was actually in thestore. More evidence (IMHO) for store-position-bias :) Intuitively, weshould be near the data the network sends us, and not search the wholenetwork for the data we want.


--
Robert Hailey

_______________________________________________
Devl mailing list
[email protected]
http://emu.freenetproject.org/cgi-bin/mailman/listinfo/devl

Re: [freenet-dev] r16886 - trunk/freenet/src/freenet/node

Reply via email to