Re: [HACKERS] Load Distributed Checkpoints test results

Heikki Linnakangas Wed, 20 Jun 2007 11:02:28 -0700

I've uploaded the latest test results to the results page athttp://community.enterprisedb.com/ldc/

The test results on the index page are not in a completely logicalorder, sorry about that.

I ran a series of tests with 115 warehouses, and no surprises there. LDCsmooths the checkpoints nicely.

Another series with 150 warehouses is more interesting. At that # ofwarehouses, the data disks are 100% busy according to iostat. The 90%percentile response times are somewhat higher with LDC, though thevariability in both the baseline and LDC test runs seem to be prettyhigh. Looking at the response time graphs, even with LDC there's clearcheckpoint spikes there, but they're much less severe than without.

Another series was with 90 warehouses, but without think times, drivingthe system to full load. LDC seems to smooth the checkpoints very nicelyin these tests.


Heikki Linnakangas wrote:

Gregory Stark wrote:
"Heikki Linnakangas" <[EMAIL PROTECTED]> writes:
Now that the checkpoints are spread out more, the response times arevery
smooth.
So obviously the reason the results are so dramatic is that thecheckpointsused to push the i/o bandwidth demand up over 100%. By spreading itout youcan see in the io charts that even during the checkpoint the i/o busyrate
stays just under 100% except for a few data points.
If I understand it right Greg Smith's concern is that in a busiersystem whereeven *with* the load distributed checkpoint the i/o bandwidth demandduring the checkpoint was *still* being pushed over 100% then spreading outthe load
would only exacerbate the problem by extending the outage.
To that end it seems like what would be useful is a pair of tests withandwithout the patch with about 10% larger warehouse size (~ 115) whichwould
push the i/o bandwidth demand up to about that level.
I still don't see how spreading the writes could make things worse, butrunning more tests is easy. I'll schedule tests with more warehousesover the weekend.
It might even make sense to run a test with an outright overloaded tosee ifthe patch doesn't exacerbate the condition. Something with a warehousesize ofmaybe 150. I would expect it to fail the TPCC constraints either waybut whatwould be interesting to know is whether it fails by a larger marginwith the
LDC behaviour or a smaller margin.
I'll do that as well, though experiences with tests like that in thepast have been that it's hard to get repeatable results that way.




--
  Heikki Linnakangas
  EnterpriseDB   http://www.enterprisedb.com

---------------------------(end of broadcast)---------------------------
TIP 4: Have you searched our list archives?

              http://archives.postgresql.org

Re: [HACKERS] Load Distributed Checkpoints test results

Reply via email to