Re: [HACKERS] Final background writer cleanup for 8.3

Greg Smith Sun, 26 Aug 2007 14:19:38 -0700

On Sun, 26 Aug 2007, Kevin Grittner wrote:

usagecount | count | isdirty
------------+-------+---------
         0 |  8711 | f
         1 |  9394 | f
         2 |  1188 | f
         3 |   869 | f
         4 |   160 | f
         5 |   157 | f

Here's a typical sample from your set. Notice how you've got very fewbuffers with a high usage count. This is a situation the backgroundwriter is good at working with. Either the old or new work-in-progressLRU writer can aggressively pound away at any of the buffers with a 0usage count shortly after they get dirty, and that won't be inefficientbecause there aren't large numbers of other clients using them.


Compare against this other sample:

usagecount | count | isdirty
------------+-------+---------
         0 |  9093 | f
         1 |  6702 | f
         2 |  2267 | f
         3 |   602 | f
         4 |   428 | f
         5 |  1388 | f

Notice that you have a much larger number of buffers where the usage countis 4 or 5. The all-scan part of the 8.2 background writer will waste alot of writes when you have a profile that's more like this. If therehave been 4+ client backends touching the buffer recently, you'd be crazyto write it out right now if you could instead be focusing on banging outthe ones where the usage count is 0. The 8.2 background writer wouldwrite them out anyway, which meant that when you hit a checkpoint both theOS and the controller cache were filled with such buffers before you evenstarted writing the checkpoint data. The new setup in 8.3 only worriesabout the high usage count buffers when you hit a checkpoint, at whichpoint it streams them out over a longer, adjustable period (as not tospike the I/O more than necessary and block your readers) than the 8.2design, which just dumped them all immediately.

Just to be sure that I understand, are you saying it would be a bad scene if
the physical writes happened, or that the overhead of pushing them out to
the OS would be crippling?

If you have a lot of buffers where the usage_count data was high, it wouldbe problematic to write them out every time they were touched; odds aregood somebody else is going to dirty them again soon enough so why bother.On your workload, that doesn't seem to be the case. But that is thesituation on some other test workloads, and balancing for that situationhas been central to the parts of the redesign I've been injectingsuggestions into. One of the systems I was tormented by had theusagecount of 5 for >20% of the buffers in the cache under heavy load, andhad a physical write been executed every time one of those was touchedthat would have been crippling (even if the OS was smart to cache andtherefore make redundant some of the writes, which is behavior I wouldprefer not to rely on).

This contrib module seems pretty safe, patch and all.  Does anyone think
there is significant risk to slipping it into the 8.2.4 database where we
have massive public exposure on the web site handling 2 million hits per
day?

I think it's fairly safe, and my patch was pretty small; just exposingsome data that nobody had been looking at before. Think how much easieryour life would have been when doing your earlier tuning if you werelooking at the data in these terms. Just be aware that running the queryis itself intensive and causes its own tiny hiccup in throughput everytime it executes, so you may want to consider this more of a snapshot yourun periodically to learn more about your data rather than something youdo very regularly.

I also think we need to somehow develop a set of tests which reportmaximum response time on (what should be) fast queries while thedatabase is under different loads, so that those of us for whom reliableresponse time is more important than maximum overall throughput areprotected from performance regressions.

My guess is that the DBT2 tests that Heikki has been running are a morecomplicated than you think they are; there are response time guaranteerequirements in there as well as the throughput numbers. The tests that Irun (which I haven't been publishing yet but will be with the final patchsoon) also report worst-case and 90-th percentile latency numbers as wellas TPS. A "regression" that improved TPS at the expense of those twowould not be considered an improvement by anyone involved here.


--
* Greg Smith [EMAIL PROTECTED] http://www.gregsmith.com Baltimore, MD

---------------------------(end of broadcast)---------------------------
TIP 7: You can help support the PostgreSQL project by donating at

               http://www.postgresql.org/about/donate

Re: [HACKERS] Final background writer cleanup for 8.3

Reply via email to