Re: Replication: stalled?

Wayne Conrad Mon, 28 Feb 2011 18:41:54 -0800

On 02/21/2011 11:18 AM, Wayne Conrad wrote:

I'm seeing replication behavior that I don't understand. I wonder ifit's stalled.


It's not stalled.  It's going very, very slowly.  I think I understand why.

Some of my documents have tens of thousands of attachments. When Ifirst started storing the fat documents in couchdb, it took half an houror more to add them. To make it faster, and to prevent timeouts, Istore the attachments inline, but in chunks of 100 attachments at atime. Doing that, even my largest documents take only a minute or so tostore.

I can store a document with 32,768 attachments of 4k each in 55 seconds(2.4k/sec). But to replicate that document (using "pull" replication)takes 19.5 minutes. That's 115k per second. Storing, then, is 20 timesfaster than replicating. When I look at the log on the source database,I see that the destination database is retrieving one attachment at atime, and (I presume) experiencing the same speed problem that caused meto write my "store bunches of attachments at a time" optimization. Nowit seems that, in order for replication to have any chance of keeping upwith the rate at which I can store data, I'm going to need the same sortof optimization during replication.

I'm a couch toddler, and when it comes to Erlang, I'm not even on solidfood yet. What are the odds of me writing my own replication engine in,say, Ruby, one that can do the special optimizations I need? Howdifficult a project is it?

Re: Replication: stalled?

Reply via email to