Re: [HACKERS] PATCH: regular logging of checkpoint progress

Greg Smith Fri, 02 Sep 2011 23:20:01 -0700

On 09/02/2011 11:10 AM, Tomas Vondra wrote:

My 'ideal' solution would be either to add another GUC (to turn this
on/off) or allow log_checkpoints to have three values


log_checkpoints = {off, normal, detailed}

where 'normal' provides the current output and 'detail' produces this much
verbose output.

If this is going to be acceptable, that's likely the only path it couldhappen by and still meet what you're looking for. I will just againstress that the part you're working on instrumenting better right now isnot actually where larger systems really run into the most problemshere, based on what I've seen. I added a series of log messages to 9.1at DEBUG1, aimed at tracking the sync phase. That's where I see manymore checkpoint issues than in the write one. On Linux in particular,it's almost impossible for the write phase to be more of a problem thanthe sync one.

So the logging you're adding here I don't ever expect to turn on. But Iwouldn't argue against an option to handle the logging use-case you'reconcerned about. Letting people observe for themselves and decide whichof the phases is more interesting to their workload seems appropriate.Then users have options for what to log, no matter which type of problemthey run into.

If you're expanding log_checkpoints to an enum, for that to handle whatI think everybody might ever want (for what checkpoints do now atleast), I'd find that more useful if it happened like this instead:


log_checkpoints = {off, on, write, sync, verbose}

I don't think you should change the semantics of off/on, which willavoid breaking existing postgresql.conf files and resources that suggesttuning advice. "write" can toggle on what you're adding; "sync" shouldcontrol whether the DEBUG1 messages showing the individual file names inthe sync phase appear; and "verbose" can include both.

As far as a heuristic for making this less chatty when there's nothingexciting happening goes, I think something based on how much time haspassed would be the best one. In your use case, I would guess you don'treally care whether a message appears every n%. If I understand youcorrectly now, you would mainly care about getting enough log detail toknow 1) when things are running really slow, or b) often enough that themargin of error in your benchmark results from unaccounted checkpointwrites is acceptable. In both of those cases, I'd think a time-basedthreshold would be appropriate, and that also deals with the time-basedcheckpoints, too.

If your logging criteria for the write phase was "display a message anytime more than 30 seconds have passed since last seeing one", that wouldgive you only a few lines of output in a boring, normalcheckpoint--certainly less than the 9 in-progress samples you'reoutputting now, at 10% intervals. But in the pathological situationswhere writes are super slow, your log data would become correspondinglydenser, which is exactly what you want in that situation.

I think combining the two makes the most sense: "log when >=30 secondshave passed since the last message, and there's been >=10% more progressmade". (Maybe do the progress check before the time one, to cut down ongettimeofday() calls) That would give you 4 in-progress reports duringa standard 2.5 minute write phase, and in cases where the checkpointsare taking a long time you'd get as many as 9. That's pretty close toauto-tuning the amount of log output, so the amount of it is roughlyproportional to how likely the information it's logging will beinteresting.

We certainly don't want to add yet another GUC just to control thefrequency. I don't think it will be too hard to put two hard-codedthresholds in and do good enough for just about everyone though. Iwould probably prefer setting those thresholds to 60 seconds/20%instead. That might not be detailed enough for you though.


--
Greg Smith   2ndQuadrant US    g...@2ndquadrant.com   Baltimore, MD
PostgreSQL Training, Services, and 24x7 Support  www.2ndQuadrant.us


--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] PATCH: regular logging of checkpoint progress

Reply via email to