Re: [HACKERS] autovacuum_work_mem

Heikki Linnakangas Mon, 16 Dec 2013 02:13:57 -0800

On 12/13/2013 08:40 PM, Alvaro Herrera wrote:

Heikki Linnakangas escribió:

I haven't been following this thread in detail, but would it help if
we implemented a scheme to reduce (auto)vacuum's memory usage? Such
schemes have been discussed in the past, packing the list of dead
items more tightly.


Well, it would help some, but it wouldn't eliminate the problem
completely.  Autovacuum scales its memory usage based on the size of the
table.  There will always be a table so gigantic that a maximum
allocated memory is to be expected; and DBAs will need a way to limit
the memory consumption even if we pack dead items more densely.

I was playing with keeping item pointers for each page in a bitmapset.
This was pretty neat and used a lot less memory than currently, except
that I needed to allocate a large chunk of memory and then have
bitmapsets use words within that large allocation space.  It turned out
to be too ugly so I abandoned it.  With the "varbit encoding" thingy in
the recent GIN patchset, maybe it would be workable.

The varbyte encoding is actually a very poor fit for vacuum. Vacuumneeds fast random access into the array when scanning indexes, and thevarbyte encoded item pointer lists used in gin don't allow that.

I couldn't find it in the archives now, but when we last discussed this,Tom suggested that we divide the large chunk of memory that vacuumallocates into two parts. The first part grows from the bottom up, andthe second part from top down, until there is no free space in themiddle anymore. For each heap page, there is one entry in the firstpart, with the block number, and a pointer to an entry in the secondpart. In the second part, there's a list of offset numbers on that page(or a bitmap).

Another idea: Store only the least significant 20 bits the block numberof each item pointer, and use the remaining 12 bits for the offsetnumber. So each item pointer is stored as a single 32 bit integer. Forthe top 12 bits of the block number, build a separate lookup table of4096 entries, indexed by the top bits. Each entry in the lookup tablepoints to the beginning and end index in the main array where theentries for that page range is stored. That would reduce the memoryusage by about 1/3, which isn't as good as the bitmap method when thereis a lot of dead tuples same pages, but would probably be a smaller patch.


- Heikki


--
Sent via pgsql-hackers mailing list ([email protected])
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] autovacuum_work_mem

Reply via email to