Re: [HACKERS] Displaying accumulated autovacuum cost

Greg Smith Tue, 21 Feb 2012 20:55:48 -0800

I just took this for spin. Everything I tried worked, docs built andread fine. The description of how "dirty" differs from "written" is abit cryptic, but I don't see an easy way to do better without a wholenew section on that topic. Once the extension upgrade questions aresorted out, I'd say this is ready to commit. Example I have at thebottom here shows a case where this is a big improvement over theexisting tracking. I think this is a must-have improvement if we'regoing to advocate using pg_stat_statements for more things.

This works as expected in all of the EXPLAIN forms, I tried all of thesupported formats. Sample of the text one:

$ psql -d pgbench -c "EXPLAIN (ANALYZE,BUFFERS,FORMAT text) UPDATEpgbench_accounts SET aid=aid+0 WHERE aid<1000"

QUERY PLAN
----------

Update on pgbench_accounts (cost=0.00..86.09 rows=860 width=103)(actual time=8.587..8.587 rows=0 loops=1)

   Buffers: shared hit=8315 read=70 dirtied=16

-> Index Scan using pgbench_accounts_pkey on pgbench_accounts(cost=0.00..86.09 rows=860 width=103) (actual time=0.017..2.086 rows=999

 loops=1)
         Index Cond: (aid < 1000)
         Buffers: shared hit=1828 read=28
 Total runtime: 8.654 ms

Also ran just the UPDATE statement alone, then retrieved the counts frompg_stat_statements:


$ psql -x -c "select * from pg_stat_statements"

-[ RECORD 1]-------+-------------------------------------------------------------------------------------------

userid              | 10
dbid                | 16385
query               | UPDATE pgbench_accounts SET aid=aid+0 WHERE aid<1000
calls               | 1
total_time          | 0.007475
rows                | 999
shared_blks_hit     | 8370
shared_blks_read    | 15
shared_blks_dirtied | 15
shared_blks_written | 0
...

Note that there are no blocks shown as written there. That is alsodemonstrated by the results after some pgbench "-M prepared" stresstesting against a small database. The pgbench tables are structuredsuch that the number of branches < tellers << accounts. On a smallscale database (I used 10 here), there might only be a single page ofbranch data. That shows up clearly in the different amount of dirtiedblocks in each update:

$ psql -x -c "selectquery,shared_blks_hit,shared_blks_read,shared_blks_dirtied,shared_blks_writtenfrom pg_stat_statements order by calls desc limit 7"

...

query | UPDATE pgbench_branches SET bbalance = bbalance +$1 WHERE bid = $2;

shared_blks_hit     | 32929
shared_blks_read    | 0
shared_blks_dirtied | 1
shared_blks_written | 0

query | UPDATE pgbench_tellers SET tbalance = tbalance +$1 WHERE tid = $2;

shared_blks_hit     | 19074
shared_blks_read    | 0
shared_blks_dirtied | 7
shared_blks_written | 0

query | UPDATE pgbench_accounts SET abalance = abalance +$1 WHERE aid = $2;

shared_blks_hit     | 35563
shared_blks_read    | 9982
shared_blks_dirtied | 4945
shared_blks_written | 2812

Note how in the branches and tellers case, the existing "written"counter shows 0. Those hot pages stay in cache the whole time with ahigh usage count, backends never get to write them out; only thecheckpointer does. Only this new "dirtied" one reflects a useful writecount for frequently used pages like that, and it does show that morepages are being touched by pgbench_tellers than pgbench_branches.

I'd never ran into this before because I normally test against largerdatabases. But once I tried to find an example of this form, it waseasy to do so. Systems where much of the database fits intoshared_buffers in particular are likely to see a deceptively small writecount.


--
Greg Smith   2ndQuadrant US    g...@2ndquadrant.com   Baltimore, MD
PostgreSQL Training, Services, and 24x7 Support www.2ndQuadrant.com

--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] Displaying accumulated autovacuum cost

Reply via email to