Re: [HACKERS] Design proposal: fsync absorb linear slider

Greg Smith Fri, 26 Jul 2013 06:42:14 -0700

On 7/26/13 9:14 AM, didier wrote:

During recovery you have to load the log in cache first before applying WAL.

Checkpoints exist to bound recovery time after a crash. That is theironly purpose. What you're suggesting moves a lot of work into therecovery path, which will slow down how long it takes to process.

More work at recovery time means someone who uses the default ofcheckpoint_timeout='5 minutes', expecting that crash recovery won't takevery long, will discover it does take a longer time now. They'll beforced to shrink the value to get the same recovery time as they docurrently. You might need to make checkpoint_timeout 3 minutes instead,if crash recovery now has all this extra work to deal with. And whenthe time between checkpoints drops, it will slow the fundamentalefficiency of checkpoint processing down. You will end up writing outmore data in the end.

The interval between checkpoints and recovery time are all related. Ifyou let any one side of the current requirements slip, it makes the resteasier to deal with. Those are all trade-offs though, not improvements.And this particular one is already an option.

If you want less checkpoint I/O per capita and don't care about recoverytime, you don't need a code change to get it. Just makecheckpoint_timeout huge. A lot of checkpoint I/O issues go away if youonly do a checkpoint per hour, because instead of random writes you'regetting sequential ones to the WAL. But when you crash, expect to bedown for a significant chunk of an hour, as you go back to sort out allof the work postponed before.


--
Greg Smith   2ndQuadrant US    [email protected]   Baltimore, MD
PostgreSQL Training, Services, and 24x7 Support www.2ndQuadrant.com


--
Sent via pgsql-hackers mailing list ([email protected])
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] Design proposal: fsync absorb linear slider

Reply via email to