Re: [HACKERS] autovacuum not prioritising for-wraparound tables

Jim Nasby Wed, 30 Jan 2013 13:06:54 -0800

On 1/25/13 11:56 AM, Christopher Browne wrote:

With a little bit of noodling around, here's a thought for a joint function
that I*think*  has reasonably common scales:


f(deadtuples, relpages, age) =
    deadtuples/relpages + e ^ (age*ln(relpages)/2^32)

Be careful with dead/relpages, because dead tuples increase relpages aswell. The effect is extremely noticeable on frequently hit tables thatneed to be kept small. If you want to have a deadtuples/size metric, Ithink it would be far better to do deadtuples/non_bloated_table_size.

Someone else in the thread mentioned that what we really need to bewatching aren't raw values, but trends. Or you can think of it aswatching first (or even second) derivatives if you like. I couldn'tagree more. I believe there are several parts of Postgres that end upwith a bunch of hard to tune GUCs specifically because we're measuringthe wrong things.

Take freezing for example. Since the only reason to freeze is XID wrapthen the *ideal* time to start a freeze vacuum on a table is so that thevacuum would end *exactly* as we were about to hit XID wrap.

Obviously that's a completely impractical goal to hit, but notice thesimplicity of the goal: we only care about the vacuum ending rightbefore we'd hit XID wrap. The only way to do that is to monitor how fastvacuums are running, how fast XIDs are being consumed, and how quicklythe oldest XID in each table is advancing. Notice that all of thosemeasurements are time derivatives.

From a more practical standpoint, I think it would be extremely usefulto have a metric that showed how quickly a table churned. Something likedead tuples per time period. Comparing that to the non-bloated tablesize should give a very strong indication of how critical frequentvacuums on that table are.

I don't have a good metric in mind for freeze right now, but I do wantto mention a use case that I don't think has come up before. Whenbuilding a londiste slave (and presumably all the other triggerreplication systems suffer from this), each table is copied over in asingle transaction, and then updates start flowing in for that table.That can easily result in a scenario where you have an enormous volumeof tuples that will all need freezing at almost exactly the same time.It would be nice if we could detect such a condition and freeze thosetuples over time, instead of trying to freeze all of them in one shot.



--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] autovacuum not prioritising for-wraparound tables

Reply via email to