Re: Scaling XLog insertion (was Re: [HACKERS] Moving more work outside WALInsertLock)

Heikki Linnakangas Mon, 05 Mar 2012 00:32:42 -0800

On 20.02.2012 08:00, Amit Kapila wrote:

I was trying to understand this patch and had few doubts:


1. In PerformXLogInsert(), why there is need to check freespace when
already during ReserveXLogInsertLocation(), the space is reserved. Is
it possible that the record size is more than actually calculted in
ReserveXLogInsertLocation(), if so in that case what I understand is
it is moving to next page to write, however isn't it possible that
some other backend had already reserved that space.

The calculations between PerformXLogInsert (called CopyXLogRecordToWAL()in the latest patch version) and ReserveXLogInsertLocation() must alwaysmatch, otherwise we have reserved incorrect amount of WAL and you getcorrupt WAL. They both need to do the same calculations of how the WALrecord is split across pages, which depends on how much free space thereis on the first page. There is an assertion in CopyXLogRecordToWAL() tocheck that once it's finished writing the WAL record, the last bytelanded on the position that ReserveXLogInsertLocation() calculated itwould.

Another way to do that would be to remember the calculations done inReserveXLogInsertLocation(), in an extra array or something. But we wantto keep ReserveXLogInsertLocation() as simple as possible, as that runswhile holding the spinlock. Any extra CPU cycles there will hurtscalability.

2. In function WaitForXLogInsertionSlotToBecomeFree(), chances are
there such that when nextslot equals lastslot, all new backends try
to  reserve a slot will start waiting on same last slot which can
lead to serialization for those backends and can impact latency.

True. That warrants some performance testing to see if that effect issignificant. (it's surely better than the current situation, anyway,where all WAL insertions block on the single lock)

3. GetXlogBuffer - This will get called twice, once for normal
buffer, second time for when there is not enough space in current
page, and both times it can lead to I/O whereas in earlier algorithm,
the chances of I/O is only once.

I don't see any difference to the previous situation. In both cases, ifyou need a new page to copy the WAL record to, you need to first flushout some old pages from the WAL buffers if they're all dirty. The patchdoesn't change the number of WAL buffers consumed. Note thatGetXLogBuffer() is very cheap when it doesn't need to do I/O, extracalls to it don't matter if the page is already initialized.


--
  Heikki Linnakangas
  EnterpriseDB   http://www.enterprisedb.com

--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: Scaling XLog insertion (was Re: [HACKERS] Moving more work outside WALInsertLock)

Reply via email to