Re: [HACKERS] CLUSTER and MVCC

Heikki Linnakangas Thu, 15 Mar 2007 07:20:32 -0800

Tom Lane wrote:

Heikki Linnakangas <[EMAIL PROTECTED]> writes:
I'm thinking of keeping an in-memory mapping of old and new tids ofupdated tuples while clustering, instead. That means that clusterrequires a little bit of memory for each RECENTLY_DEAD updated tuple. Inthe worst case that means that you run out of memory if there's too manyof those in the table, but I doubt that's going to be a problem in practice.
That is more or less isomorphic to what VACUUM FULL does.  While people
have complained about VACUUM FULL's memory usage on occasion, just at
the moment I feel that the main problem with it is complexity.  If we
still haven't gotten all the bugs out of VACUUM FULL after more than
eight years of work on it, what are the odds that we can make CLUSTER
do it right the first time?


Well, I can't guarantee that there's no bugs.

To copy a chain correctly, we need to correctly detect tuples that havea t_ctid pointing to a non-dead tuple (non-dead meaningHeapTupleSatisfiesVacuum(tuple) != DEAD), and tuples that are beingpointed to by a non-dead tuple. If we incorrectly detect that a tuplebelongs to either of those categories, when in fact it doesn't, we don'tcorrupt anything, but we waste a little bit of memory memorizing thetuple unnecessarily.

To detect tuples in the first category, we need to check that xmax ofthe tuple isn't invalid, and t_ctid doesn't point to itself.

To detect tuples in the second category, we need to check that xminisn't invalid, and is greater than OldestXmin.

With both categories correctly identified, it's just a matter of mappingold ctids to corresponding tids in the new heap.

Unlike in my first proposal, if something nevertheless goes wrong indetecting the chains, we only lose the chaining between the tuples, butwe don't otherwise lose any data. The latest version of each row is fineanyway. I think this approach is pretty robust, and it fails in a good way.


--
  Heikki Linnakangas
  EnterpriseDB   http://www.enterprisedb.com

---------------------------(end of broadcast)---------------------------
TIP 3: Have you checked our extensive FAQ?

              http://www.postgresql.org/docs/faq

Re: [HACKERS] CLUSTER and MVCC

Reply via email to