Re: [Sequoia] PostgreSQL Documentation of High Availability and Load Balancing

Emmanuel Cecchet Wed, 22 Nov 2006 03:11:10 -0800

Markus,

Okay, I have forwarded that to Bruce, who's editing the documentation(and is a native English speaker). I'm not sure how we can cover this,as we are very general in our description.
You might want to recheck the paragraph, which is now called"Synchronous Multi-Master Replication":
http://momjian.us/main/writings/pgsql/sgml/high-availability.html

Ok, thanks I'll have a look at this on Friday.

I'm particularly unsure, where Sequoia would fit in. There is stillthe split between "Statement-Based Replication Middleware" and"Synchonous Multi-Master Replication".
Does Sequoia offer any form of async replication?

Yes. We provide to the user a consistent view of data as synchronousreplication would do, but behind the scene we allow asynchrony betweenbackends and results can be returned as soon as a backend hassuccessfully completed a query execution.

Yes but don't underestimate the capability of a single node toexecute transactions in parallel as well. Oftentimes sending 2concurrent transactions to a single node or to 2 different nodes doesnot make any difference (obviously it depends on the nature of thetransaction).
Okay, I just have to believe that. Up until now I'm mostly basing ontheoretical estimates, rather than hard facts. :-) You seem to havemade some real benchmarks. Did you publish them?

You can look at slides #63 & 64 inhttp://www.continuent.org/uploads/sequoia/Resources/2006-08-15Cecchet_ApacheConAsia2006.pdfIt shows the response time with a single database and with a cluster of2 dbs. Response time remains constant until you go beyond the sweet spotof a single node.We did not publish any official results with hard numbers. Note that youcan always build/tune a benchmark to show what you want to show. Sothere is nothing better than testing with your application and see foryourself. We have few recipes for performance but they are part of theexpertise we use for our commercial services.

May I ask what you use for automatic testing and benchmarking? I'mcurrently stuck testing 90% of the time. Starting up the GCS, twoDatabases and attaching the debugger to every process which couldpossibly go havoc really takes a F***ING lot of time!

We have developed a complete automated distributed testing framework(probably more code there than in the software to test itself!).Software, databases and clients are deployed automatically with theproper configuration, tests are run and outputs are captured fordiagnosis. On the failover side, some scenarios can be automaticallygenerated (about 7000 failover scenarios are possible with a simple MSTin a 4 nodes cluster ...).On the benchmarking side we use a couple of different benchmarks thatare available in the open source. It is good to have bothmicrobenchmarks and also more 'real' macro workloads such as TPC-*.This whole test infrastructure is proprietary code that we have notdecided yet to open source.


Best regards,
Emmanuel

--
Emmanuel Cecchet
Chief Scientific Officer, Continuent

Blog: http://emanux.blogspot.com/
Open source: http://www.continuent.org
Corporate: http://www.continuent.com
Skype: emmanuel_cecchet
Cell: +33 687 342 685


_______________________________________________
Sequoia mailing list
[email protected]
https://forge.continuent.org/mailman/listinfo/sequoia

Re: [Sequoia] PostgreSQL Documentation of High Availability and Load Balancing

Reply via email to