Continuous replication stability issues
---------------------------------------

                 Key: COUCHDB-481
                 URL: https://issues.apache.org/jira/browse/COUCHDB-481
             Project: CouchDB
          Issue Type: Bug
          Components: Database Core
    Affects Versions: 0.10
            Reporter: Robert Newson
            Priority: Blocker


I've been trying continuous replication with a different combinations of 
push/pull with 2, 3 and 4 nodes. I've hit several problems and discussed them 
on IRC with jan___ and kocolosk.

Firstly, the status page in Futon (and the output of _active_tasks) becomes 
inaccurate sometimes (and does not recover). This complicates investigation of 
the more serious problems.

I configured a circle of four nodes with continuous pull replication and used 
'ab' to write documents to the first one. Success is for all documents to 
appear at all nodes. For small batches of documents, this works. It fails, 
every time, with large numbers. I use batch=ok on all requests and have not 
successfully run a 100k run. 

The replication task at some point in the circle eventually dumps a huge 
stacktrace (which kocolosk has seen and I would need to sanitize private server 
names from before I could post) and dies, and is not restarted. Worse, the 
client process injecting the documents also dies sometimes.

I have had perfect replication runs with 2 and 3 nodes in a circle, and no 
successful replication runs with 4 nodes. Using a star pattern (where each node 
pulls or pushes to the remaining three) fails even more rapidly.





-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to