Re: [HACKERS] Seq scans roadmap

Heikki Linnakangas Thu, 10 May 2007 04:36:22 -0700

Zeugswetter Andreas ADI SD wrote:

Also, that patch doesn't address the VACUUM issue at all. Andusing a small fixed size ring with scans that do updates canbe devastating. I'm experimenting with different ring sizesfor COPY at the moment. Too small ring leads to a lot of WALflushes, it's basically the same problem we have with VACUUMin CVS HEAD.
My first take on that would be to simply abandon any dirty (and actually
also any still pinned) buffer from the ring and replace the ring slot
with a buffer from the freelist.
If the freelist is empty and LSN allows writing the buffer, write it
(and maybe try to group these).
If the LSN does not allow the write, replace the slot with a buffer from
LRU.

That would effectively disable the ring for COPY and the 2nd phase ofVACUUM.

One problem with looking at the LSN is that you need the content lock toread it, and I wouldn't want to add any new locking. It could be doneinside FlushBuffer when we hold the lock anyway, but I'm afraid thechanges would be pretty invasive.

I'm struggling to get a grip of what the optimal ring size is undervarious circumstances. Some thoughts I have this far:

- a small ring gives better L2 cache behavior

- for read-only queries, and for queries that just hint bits, 1 bufferis enough- small ring with query that writes WAL (COPY, mass updates, 2nd phaseof VACUUM) leads to a lot of WAL flushes, which can become bottleneck.

But all these assumptions need to be validated. I'm setting up testswith different ring sizes and queries to get a clear picture of this:

- VACUUM on a clean table
- VACUUM on a table with 1 dead tuple per page
- read-only scan, large table
- read-only scan, table fits in OS cache
- COPY

In addition, I'm going to run VACUUM in a DBT-2 test to see the affecton other queries running concurrently.

I think a ring that grows when WAL flushes occur covers all the usecases reasonably well, but I need to do the testing...


--
  Heikki Linnakangas
  EnterpriseDB   http://www.enterprisedb.com

---------------------------(end of broadcast)---------------------------
TIP 2: Don't 'kill -9' the postmaster

Re: [HACKERS] Seq scans roadmap

Reply via email to