Re: [HACKERS] Design proposal: fsync absorb linear slider

Greg Smith Tue, 23 Jul 2013 09:15:51 -0700

On 7/23/13 10:56 AM, Robert Haas wrote:

On Mon, Jul 22, 2013 at 11:48 PM, Greg Smith <[email protected]> wrote:

We know that a 1GB relation segment can take a really long time to write
out.  That could include up to 128 changed 8K pages, and we allow all of
them to get dirty before any are forced to disk with fsync.


By my count, it can include up to 131,072 changed 8K pages.

Even better! I can pinpoint exactly what time last night I got tiredenough to start making trivial mistakes. Everywhere I said 128 it'sactually 131,072, which just changes the range of the GUC I proposed.

Getting the number right really highlights just how bad the currentsituation is. Would you expect the database to dump up to 128K writesinto a file and then have low latency when it's flushed to disk withfsync? Of course not. But that's the job the checkpointer process istrying to do right now. And it's doing it blind--it has no idea howmany dirty pages might have accumulated before it started.

I'm not exactly sure how best to use the information collected. fsyncevery N writes is one approach. Another is to use accumulated writes topredict how long fsync on that relation should take. Whenever I triedto spread fsync calls out before, the scale of the piled up writes frombackends was the input I really wanted available. The segment writecount gives an alternate way to sort the blocks too, you might startwith the heaviest hit ones.

In all these cases, the fundamental I keep coming back to is wanting tocue off past write statistics. If you want to predict relative I/Odelay times with any hope of accuracy, you have to start the checkpointknowing something about the backend and background writer activity sincethe last one.


--
Greg Smith   2ndQuadrant US    [email protected]   Baltimore, MD
PostgreSQL Training, Services, and 24x7 Support www.2ndQuadrant.com


--
Sent via pgsql-hackers mailing list ([email protected])
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] Design proposal: fsync absorb linear slider

Reply via email to