Re: Buffer locking is special (hints, checksums, AIO writes)

Heikki Linnakangas Sat, 07 Feb 2026 02:44:52 -0800

On 03/02/2026 00:33, Andres Freund wrote:

- The way MarkBufferDirtyHint() operates was copied into
   heap_inplace_update_and_unlock(). Now that MarkBufferDirtyHint() won't work
   that way anymore, it seems better to go with the alternative approach the
   comments already outlined, namely to only delay updating of the buffer
   contents.


   I've done this in a prequisite commit, as it doesn't actually depend on any
   of the other changes.  Noah, any chance you could take a look at this?


Patch 0001 Looks correct to me. However:

         * ["D" is a VACUUM (ONLY_DATABASE_STATS)]
         * ["R" is a VACUUM tbl]
         * D: vac_update_datfrozenxid() -> systable_beginscan(pg_class)
         * D: systable_getnext() returns pg_class tuple of tbl
         * R: memcpy() into pg_class tuple of tbl
         * D: raise pg_database.datfrozenxid, XLogInsert(), finish
         * [crash]
         * [recovery restores datfrozenxid w/o relfrozenxid]
         *
         * As we hold an exclusive lock - preventing the buffer from being 
written
         * out once dirty - we can work around this as follows: 
MarkBufferDirty(),
         * XLogInsert(), memcpy().

That last reference to 'memcpy' is a little orphaned now. The commentused to talk about the stack copy of the page, but now there's nomention of that except for this reference to memcpy(). To make thingsworse, the steps have "memcpy() into pg_class tuple of tbl", so onecould think that the "memcpy" refers to that.


How about this:

         * We avoid that by using a temporary copy of the buffer to hide our
         * change from other backends until it's been WAL-logged. We apply our
         * change to the temporary copy and WAL-log it before modifying the real
         * page. That way any action a reader of the in-place-updated value 
takes
         * will be WAL logged after this change.

- Heikki

Re: Buffer locking is special (hints, checksums, AIO writes)

Reply via email to