Re: Attachment Replication Problem - Bug Found

Antony Blakey Sat, 16 May 2009 17:32:08 -0700


On 17/05/2009, at 12:09 AM, Adam Kocoloski wrote:

So, I think there's still some confusion here. By "openconnections" do you mean TCP connections to the source? That numberis never higher than 10. ibrowse does pipeline requests on those 10connections, so there could be as many as 1000 simultaneous HTTPrequests. However, those requests complete as soon as the datareaches the ibrowse client process, so in fact the number ofoutstanding request during replication is usually very small. We'renot doing flow control at the TCP socket layer.

OK, I understand that now. That means that a document with > 1000attachments can't be replicated because ibrowse will never sendibrowse_async_headers for the excess attachments to attachment_loop,which needs to happen for every attachment before any of the data isread by doc_flush_binaries. Which is to say that every documentattachment needs to start e.g. receive headers, before any attachmentbodies are consumed.

With concurrent replications the maximum number of attachments isreduced, and it's possible to get a deadlock where the ibrowse queueis full but no document has all of it's attachment downloads started.

I'm not sure I understand what part is "not scalable". I agree thatignoring the attachment receivers and their mailboxes when decidingwhether to checkpoint is a big problem. I'm testing a fix for thatright now. Is there something else you meant by that statement?Best,

I didn't know about the ibrowse pool, so that part is scalable i.e.bounded number of connections and requests. If my comments above arecorrect, then the current architecture isn't scalable in respect tothe number of attachments in the single-replicator case, and a morecomplicated equation in the multiple-replicator case.

P.S. One issue in my mind is that we only do the checkpoint testafter we receive a document. We could end up in a situation where adocument request is sitting in a pipeline behind a huge attachment,and the checkpoint test won't execute until the entire attachment isdownloaded into memory. There are ways around this, e.g. usingibrowse:spawn_link_worker_process/2 to bypass the default connectionpool for attachment downloads.

Requiring every attachment to be started but not completed seems to meto be a fundamental issue.

In my case, I have some large attachments and unreliable links, so I'mpartial to a solution that allows progress even of partial attachmentsduring link failure. We could get this by not delaying theattachments, and buffering them to disk, using range requests on theget for partial downloads. This would solve some problems because itstarts with the requirement to always make progress, never redoingwork. This seems like it could be done reasonably transparently justby modifying the attachment download code.


Antony Blakey
-------------
CTO, Linkuistics Pty Ltd
Ph: 0438 840 787

Nothing is really work unless you would rather be doing something else.
  -- J. M. Barre

Re: Attachment Replication Problem - Bug Found

Reply via email to