Re: [HACKERS] Improvement of checkpoint IO scheduler for stable transaction responses

Greg Smith Sun, 14 Jul 2013 12:14:57 -0700

On 6/27/13 11:08 AM, Robert Haas wrote:

I'm pretty sure Greg Smith tried it the fixed-sleep thing before and
it didn't work that well.

That's correct, I spent about a year whipping that particular horse andsubmitted improvements on it to the community.http://www.postgresql.org/message-id/4d4f9a3d.5070...@2ndquadrant.comand its updates downthread are good ones to compare this current workagainst.

The important thing to realize about just delaying fsync calls is thatit *cannot* increase TPS throughput. Not possible in theory, obviouslydoesn't happen in practice. The most efficient way to write things outis to delay those writes as long as possible. The longer you postpone awrite, the more elevator sorting and write combining you get out of theOS. This is why operating systems like Linux come tuned for suchdelayed writes in the first place. Throughput and latency are linked;any patch that aims to decrease latency will probably slow throughput.

Accordingly, the current behavior--no delay--is already the bestpossible throughput. If you apply a write timing change and it seems toincrease TPS, that's almost certainly because it executed lesscheckpoint writes. It's not a fair comparison. You have to adjust anydelaying to still hit the same end point on the checkpoint schedule.That's what my later submissions did, and under that sort of controlledcondition most of the improvements went away.

Now, I still do really believe that better spacing of fsync calls helpslatency in the real world. Far as I know the server that I developedthat patch for originally in 2010 is still running with that change.The result is not a throughput change though; there is a throughput dropwith a latency improvement. That is the unbreakable trade-off in thisarea if all you touch is scheduling.

The reason why I was ignoring this discussion and working on pgbenchthrottling until now is that you need to measure latency at a constantthroughput to advance here on this topic, and that's exactly what thenew pgbench feature enables. If we can take the current checkpointscheduler and an altered one, run both at exactly the same rate, and onegives lower latency, now we're onto something. It's possible to do thatwith DBT-2 as well, but I wanted something really simple that peoplecould replicate results with in pgbench.


--
Greg Smith   2ndQuadrant US    g...@2ndquadrant.com   Baltimore, MD
PostgreSQL Training, Services, and 24x7 Support www.2ndQuadrant.com


--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] Improvement of checkpoint IO scheduler for stable transaction responses

Reply via email to