Re: handling simultaneous identical replications

Adam Kocoloski Thu, 05 Mar 2009 05:01:27 -0800

On Mar 5, 2009, at 7:24 AM, Jan Lehnardt wrote:

On 5 Mar 2009, at 07:31, Paul Davis wrote:
On Wed, Mar 4, 2009 at 8:34 PM, Adam Kocoloski <[email protected]> wrote:
Hi folks, we've been running into a problem where multiplereplications withthe same source and target are running simultaneously. Thisintroducesquite a lot of unnecessary network traffic and causes realproblems withupdate collisions on the local replication history documents. IfreplicatorA updates the source doc and replicator B updates the target doc,subsequent
replications will decide that a full replication is necessary.
I have some ideas about how to ensure only one is running at atime (more onthat in a separate mail), but I'd like some feedback on how tohandle thesecond..Nth request. Let's call the initial POST to _replicate"A" and the
second POST "B":

Option 1 -- Respond to B with the results from A
This option works fine if the source is remote. However, if thesource islocal, the replication started by A will be missing updates to thesource DB
that occurred between A and B.  B may be surprised by that result.

Option 2 -- Grab an updated DB and continue the replication
This option will include updates to the source that occurredbetween A and B
in the response to both requests.

Option 3 -- Respond to A, then trigger another replication for B
In this case we wait till the replication started by A hascompleted, thendo an incremental one and respond to B with the results of thatincremental.
I think I'd vote for 3.  Cheers, Adam
If I follow this correctly, the issue is, "POST to _replicate, a
second POST to _replicate occurs before the first request finishes"
(with the same source/target info).

My knowledge of replication is only cursory, but I could also see:

Option 4:

Same as views, we wait for replication to finish and return the same
result to all clients that made a request.
I understand this and Adam's option 3 to be the same. What am Imissing? :)

No, not quite. In Option 3 the two requesters get differentresponses. A gets the result of the original request, B gets theresult of the replication triggered automatically after the first onethat replicates any updates to the DB which happened during the firstpass. If no updates occurred, B will receive the result of the firstreplication.

Paul's Option 4 is more like Options 1 and 2, where A and B getidentical responses. The difference between 1 and 2 is just whethernew updates get included in that response.


Whew.

Option 5:

Return an error on B that says, "Yeah, yeah. Already on it."


This would make replication behave a bit like compaction.

Sort of, in that additional triggers are no-ops. Option 1 also hasthat behavior.

I think I like 3/4 best.

Cheers
Jan
--



Best, Adam

Re: handling simultaneous identical replications

Reply via email to