Re: [freenet-dev] Can we implement Bloom filter sharing quickly???

Robert Hailey Fri, 01 May 2009 12:37:50 -0700


On May 1, 2009, at 9:35 AM, Matthew Toseland wrote:

IMPLEMENTING IT:
Main tasks:
- Converting our datastore indexes into a compact, slightly morelossy, 1-bit
filter. (Easy)
- Creating a snapshot once an hour, and keeping for some period (1week?).
(Easy)
- Creating an efficient binary diff format for hour-to-hour diffs,and keeping
them. (Moderate)

This might actually be quite hard as an efficient bloom filter willscatter (even a few) updates all over the filter.

- Sending our filters to our darknet peers. (Easy)
- Updating our darknet peers once an hour with the diffs we createanyway.
(Easy)
- Recording what version of the indexes we have sent to each darknetpeer.
(Easy)
- Updating our darknet peers when they reconnect after downtime withall the
diffs they missed. (Easy)


IMHO this revision/diff/update system is too exacting a mechanism.

If this is to be used only for failing requests, perhaps a quicker-to-implement & lossy mechanism should be considered.


i.e. a sparse (and decaying?) bloom filter
--
initialize to empty (of course)
choose an update period (1 week you mentioned?)
send out 'rolling' updates of our bloom filter
*i.e. start at 'chunk' 0, then 1, then 2

*at such a rate that in one period it will be fully transmitted (1/nthchunk every 1/period)whenever we get an update from our peer it overwrites that chunk ofthe bloom filter

*if we don't get an update from our peer, zero out that chunk
*if we have been offline, zero out the chunks we missed
--

Still must decide if going to listen to updates (i.e. keeping hugebloom filters)

Transmits are WAY easier
Receives are easy
After ~ one period it becomes usable (starts producing hits)

The downside is, the more a peer is down, the less likely we are tohave a usable filter... That is, unless a peer might be able torequest a chunk of the bloom filter that he missed/wants (!!?!)]

Or even... just make it a pull-mechanism!!!! those peers interestedin our bloom filters can request any part of it; but it might costthem request-capital.

- Using any filters we have to send GetOfferedKey-like requests forthe data,
and handling such requests. (Easy)

This is the core of the enhancement, of course; but may be easier byjust requesting the data (making it like a turtled request) ratherthan a new message.

- Track success rates for bloom-based fetches. (Optional, can bedone later)
(Easy)
- Tracking a large number of opennet peers which we have connectedto over the
last week, recording their total uptime over the week. (Moderate)
- Calculating which have sufficient uptime for sharing bloom filtersto be
worthwhile. (Easy)
- Sending those nodes the bloom filters, and keeping them up todate, and
recording what version we sent them last. (Easy)
- Limit the total memory usage for Bloom filters, by adding it upand thendropping the biggest filters from active use until we're not overthe limit.
Tell our peers so they don't send us updates. (Moderate)

_______________________________________________
Devl mailing list
[email protected]
http://emu.freenetproject.org/cgi-bin/mailman/listinfo/devl

Re: [freenet-dev] Can we implement Bloom filter sharing quickly???

Reply via email to