Re: [HACKERS] Checkpoint gets stuck in mdsync

Heikki Linnakangas Thu, 05 Apr 2007 04:47:35 -0700

ITAGAKI Takahiro wrote:

Heikki Linnakangas <[EMAIL PROTECTED]> wrote:
Now that the CheckpointStartLock starvation has been taken care of, I'mseeing another problem with checkpoints in my test run: mdsync neverfinishes.
My proposed fix is to make a copy of pendingOpsTable before entering theloop. AbsorbFsyncRequest will put new requests to a fresh newpendingOpsTable, while the mdsync loop will drain the copy. I'll write apatch along those lines if there's no better ideas.
Yeah, I'm also anxious about the stuck. I wrote a fix to use a copy of
pendingOpsTable as you said, when I implemented Load distributed checkpoint
patch. (http://momjian.us/mhonarc/patches/msg00025.html) It would make me
very happy if you review my patch and check whether my fix is proper.

I just posted a patch to pgsql-patches that fixes the issue along thelines of your Load distributed checkpoint patch. Load distributedcheckpoint patch now just needs to add the "calculate total file length"and the nap delay to mdsync.


Thanks for the patch!

--
  Heikki Linnakangas
  EnterpriseDB   http://www.enterprisedb.com

---------------------------(end of broadcast)---------------------------
TIP 2: Don't 'kill -9' the postmaster

Re: [HACKERS] Checkpoint gets stuck in mdsync

Reply via email to