[HACKERS] Checkpoint gets stuck in mdsync

Heikki Linnakangas Thu, 05 Apr 2007 04:46:48 -0700

Now that the CheckpointStartLock starvation has been taken care of, I'mseeing another problem with checkpoints in my test run: mdsync neverfinishes.


Here's what's happening:
1. checkpoint calls mdsync
2. mdsync start processing pending fsyncs from pendingOpsTable

(at this point, normal backends have to start doing writes themselves,because bgwriter is busy checkpointing and isn't keeping buffers clean)

3. after fsyncing 10 files, it calls AbsorbFsyncRequests

4. AbsorbFsyncRequests puts back entries into pendingOpsTable for thosefiles that were already fsynced.

5. mdsync starts over, goto 2.

The loop doesn't end until the test run is over, mdsync keeps fsyncingthe same over and over again.

My proposed fix is to make a copy of pendingOpsTable before entering theloop. AbsorbFsyncRequest will put new requests to a fresh newpendingOpsTable, while the mdsync loop will drain the copy. I'll write apatch along those lines if there's no better ideas.


--
  Heikki Linnakangas
  EnterpriseDB   http://www.enterprisedb.com

---------------------------(end of broadcast)---------------------------
TIP 3: Have you checked our extensive FAQ?

              http://www.postgresql.org/docs/faq

[HACKERS] Checkpoint gets stuck in mdsync

Reply via email to