Re: queue-sync-over-CouchDB

Andreas Gal Tue, 30 Jul 2013 04:23:39 -0700

Not having to carry pouchdb on the client side is definitely tempting.Also, as we discussed earlier, CouchDB's replication algorithm is not aperfect fit for our star-shaped nodes. Its meant for a moreinterconnected graph. A single outgoing changes queue avoids carryingmore history on the client than needed in the star-shaped graph.

A second benefit is that this model fits pretty well with the existinginterfaces we have in the browser for the datatypes we are talking abouthere. We have observers and mutators on all of them, and we can cheaplyfeed from observers into the table of local changes.

On the flipside, this is a lot of new code to write and get right, andits exactly the kind of tricky complex distributed state machine thatwill take the most time to debug and get ready for production. Also, ifwe implement our own replication mechanism, what is the advantage ofsticking with the CouchDB wire protocol? Its actually rather clumsy andinefficient (see proposed jsondiff delta compression). I am not arguingfor using something else than CouchDB. I am merely asking why you thinkit makes sense to stick with the wire protocol but abandon the higherlevel semantics of CouchDB.


Andreas

Brian Warner wrote:

Chris and I have been sketching out what our queue-sync idea[1] wouldlook like when run over the CouchDB API. The rough initial writeup ishere:
 https://wiki.mozilla.org/Identity/CryptoIdeas/06-Queue-Sync-CouchDB

(with some even rougher notes on an etherpad[2]).
It lacks rigor, but should be enough to see where it's headed. Thebasic idea is to use couch's "POST _bulk_docs" API (which is usedinternally by the CouchDB replication machinery) to deliver batches ofnew records to the server, some of which will be accepted, otherswhich will be rejected (due to other clients delivering their ownchanges first). We use the "GET _changes" API to learn about allserver changes, both reflections of our own, and those from otherclients. New changes are delivered to the local Provider (aka engine)for merging into Places.db/etc.
The "Mediator" is responsible for crypto, batching changes intoefficient bundles, all network traffic, and maintains a "revisiontable". This table maps locally-generated "content-revisions" toserver-generated "server-revisions", keeping them isolated fromservers and local Providers respectively. These revisions help providethe previous-version value used by compare-and-swap to reject newrecords that aren't based upon the server's previous version (think hgor git push failing because you aren't up-to-date).
This doesn't use the couch replication system (POST /_replicate), nordoes it embed a copy of CouchDB/PouchDB in the browser. It just usescouch on the server, and speaks the couch API. This seems like adecent way to get the benefits of a well-tested API and serverimplementation, without taking on the code-size or runtime costs ofhaving a full CouchDB instance inside the browser.
Let us know what you think!
 -Brian


[1]: https://wiki.mozilla.org/Identity/CryptoIdeas/05-Queue-Sync
[2]: https://id.etherpad.mozilla.org/picl-couchdb-queuesync-notes?
_______________________________________________
Sync-dev mailing list
[email protected]
https://mail.mozilla.org/listinfo/sync-dev

_______________________________________________
Sync-dev mailing list
[email protected]
https://mail.mozilla.org/listinfo/sync-dev

Re: queue-sync-over-CouchDB

Reply via email to