Re: [HACKERS] Initial 9.2 pgbench write results

Greg Smith Tue, 14 Feb 2012 12:25:52 -0800

On 02/14/2012 01:45 PM, Greg Smith wrote:

scale=1000, db is 94% of RAM; clients=4
Version TPS
9.0  535
9.1  491 (-8.4% relative to 9.0)
9.2  338 (-31.2% relative to 9.1)

A second pass through this data noted that the maximum number of bufferscleaned by the background writer is <=2785 in 9.0/9.1, while it goes ashigh as 17345 times in 9.2. The background writer is so busy now ithits the max_clean limit around 147 times in the slower[1] of the 9.2runs. That's an average of once every 4 seconds, quite frequent.Whereas max_clean rarely happens in the comparable 9.0/9.1 results.This is starting to point my finger more toward this being an unintendedconsequence of the background writer/checkpointer split.

Thinking out loud, about solutions before the problem is even naileddown, I wonder if we should consider lowering bgwriter_lru_maxpages nowin the default config? In older versions, the page cleaning work had atmost a 50% duty cycle; it was only running when checkpoints were not.If we wanted to keep the ceiling on background writer cleaning at thesame level in the default configuration, that would require droppingbgwriter_lru_maxpages from 100 to 50. That would be roughly be the sameamount of maximum churn. It's obviously more complicated than that, butI think there's a defensible position along those lines to consider.

As a historical aside, I wonder how much this behavior might have beento blame for my failing to get spread checkpoints to show a positiveoutcome during 9.1 development. The way that was written also kept thecleaner running during checkpoints. I didn't measure those two changesindividually as much as I did the combination.

[1] I normally do 3 runs of every scale/client combination, and findthat more useful than a single run lasting 3X as long. The first out ofeach of the 3 runs I do at any scale is usually a bit faster than thelater two, presumably due to table and/or disk fragmentation. I'vetried to make this less of a factor in pgbench-tools by iteratingthrough all requested client counts first, before beginning a second runof those scale/client combination. So if the two client counts were 4and 8, it would be 4/8/4/8/4/8, which works much better than 4/4/4/8/8/8in terms of fragmentation impacting the average result. Whether itwould be better or worse to eliminate this difference by rebuilding thewhole database multiple times for each scale is complicated. I happento like seeing the results with a bit more fragmentation mixed in, seehow they compare with the fresh database. Since more rebuilds wouldalso make these tests take much longer than they already do, that's thetie-breaker that's led to the current testing schedule being thepreferred one.


--
Greg Smith   2ndQuadrant US    g...@2ndquadrant.com   Baltimore, MD
PostgreSQL Training, Services, and 24x7 Support www.2ndQuadrant.com


--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] Initial 9.2 pgbench write results

Reply via email to