Re: replication using _changes API

Adam Kocoloski Fri, 12 Jun 2009 06:00:20 -0700

Hi Damien, I'm not sure I follow. My worry was that, if I built areplicator which only queried _changes to get the list of updates, I'dhave to be prepared to process a very large response. I thought onesmart way to process this response was to throttle the download at theTCP level by putting the socket into passive mode.

I agree that the HTTP client seems to be at fault, because the optionthat it exposes to switch to passive mode seems to be a no-op. Whatexactly did you mean by "streams the data while not buffering thedata"? Best,


Adam

On Jun 12, 2009, at 8:03 AM, Damien Katz wrote:

I don't think this is TCPs fault, it's the HTTP client. We need aHTTP client that streams data while not buffering the data (lowlevel TCP already buffers some), instead of sending all the datathat comes in to the waiting process, essentially bufferingeverything.
-Damien


On Jun 11, 2009, at 4:14 PM, Adam Kocoloski wrote:
I had some time to work on a replicator that queries _changesinstead of _all_docs_by_seq today. The first question that came tomy mind was how to put a spigot on the firehose. If I call_changes without a "since" qs parameter on a 10M document DB I'mgoing to get 10M chunks of output back.
I thought I might be able to control the flow at the TCP socketlevel using the inets HTTP client's {stream,{self,once}} option. Istill think this would be an elegant option if I can get it towork, but my early tests show that all the chunks still show upimmediately in the calling process regardless of whether I streamto self or {self,once}.
All for now, Adam

Re: replication using _changes API

Reply via email to