Re: [PATCHES] configure option for XLOG_BLCKSZ

Greg Smith Fri, 02 May 2008 09:29:47 -0700

On Fri, 2 May 2008, Tom Lane wrote:

The case for varying BLCKSZ is marginal already, and I've seen none atall for varying XLOG_BLCKSZ.

I recall someone on the performance list who felt it useful increaseXLOG_BLCKSZ to support a high-write environment with WAL shipping, just tomake sending the files over the network more efficient. Can't seem tofind a reference in the archives though.

If you look at things like the giant Sun system tests, there wassignificant tuning getting all the block sizes to line up better with theunderlying hardware. I would not be surprised to discover that sort ofinstall gains a bit from slinging WAL files around in larger chunks aswell. They're already using small values for commit_delay just to get thetypical WAL write to be in larger blocks.

As PostgreSQL makes it way into higher throughput environments, itwouldn't surprise me to discover more of these situations where switchingWAL segments every 16MB turns into a bottleneck. Right now, it may onlybe a few people in the world, but saying "that's big enough" for anallocation of anything usually turns out wrong if you wait long enough.

One real concern I have with making this easier to adjust is that I'd hateto let people pick any old block size with the default wal_sync_method,only to have them later discover they can't turn on any direct I/O writemethod because they botched the alignment restrictions.

Another issue though is whether it makes sense for XLOG_BLCKSZ to bedifferent from BLCKSZ at all, at least in the default case. They areboth the unit of I/O and it's not clear why you'd want different units.

There are lots of people who use completely different physical or logicaldisk setups for the WAL disk than the regular database. That's going toget even more varied moving forward as SSD starts getting used more, sincethose devices have a very different set of block size optimizationcharacteristics compared with traditional RAID setups. They prefersmaller blocks to match the underlying flash better, and you don't pay asmuch of a penalty for writing that way because lining up with the spinningdisk isn't important. Someone who put one of DB/WAL on SSD and the otheron traditional disk might end up with very different DB/WAL block sizes tomatch.


--
* Greg Smith [EMAIL PROTECTED] http://www.gregsmith.com Baltimore, MD

--
Sent via pgsql-patches mailing list ([email protected])
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-patches

Re: [PATCHES] configure option for XLOG_BLCKSZ

Reply via email to