Re: [HACKERS] Sorting writes during checkpoint

Greg Smith Tue, 15 Jul 2008 22:56:46 -0700

On Mon, 7 Jul 2008, ITAGAKI Takahiro wrote:

I will have a plan to test it on RAID-5 disks, where sequential writing
are much better than random writing. I'll send the result as an evidence.

If you're running more tests here, please turn on log_checkpoints andcollect the logs while the test is running. I'm really curious if there'sany significant difference in what that reports here in the sorted casevs. the regular one.

Smoothed checkpoint in 8.3 spreads write(), but calls fsync() at once.With sorted writes, we can call fsync() segment-by-segment for eachwrites of dirty pages contained in the segment. It could improve worstresponse time during checkpoints.

Further decreasing the amount of data that is fsync'd at any point in timemight be a bigger improvement than just the sorting itself is doing (sofar I haven't seen anything really significant just from the sort but amstill testing).

One thing I didn't see any comments from you on is how/if the sortedwrites patch lowers worst-case latency. That's the area I'd hope animproved fsync protocol would help most with, rather than TPS, which mighteven go backwards because writes won't be as bunched and therefore willhave more seeking. It's easy enough to analyze the data coming from"pgbench -l" to figure that out; example shell snipped that shows just theworst ones:


pgbench -l -N <db>
p=$!
wait $p
mv pgbench_log.${p} pgbench.log
cat pgbench.log | cut -f 3 -d " " | sort -n | tail

Actually graphing the latencies can be even more instructive, I have someexamples of that on my web page you may have seen before.

In addition, the current smgr layer is completely useless because
it cannot be extended dynamically and cannot handle multiple md-layer
modules. I would rather merge current smgr and part of bufmgr into
a new smgr and add smgr_hook() than bulk_io_hook().

I don't really have a firm opinion here about the code to comment on thisspecific suggestion, but I will say that I've found the amount of layeringin this area makes it difficult to understand just what's going onsometimes (especially when new to it). A lot of that abstraction felt abit pass-through to me, and anything that would collapse that a bit wouldbe helpful for streamlining the code instrumenting going on with thingslike dtrace.


--
* Greg Smith [EMAIL PROTECTED] http://www.gregsmith.com Baltimore, MD

--
Sent via pgsql-hackers mailing list ([email protected])
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] Sorting writes during checkpoint

Reply via email to