Re: [PATCHES] Seq scans status update

Heikki Linnakangas Fri, 18 May 2007 02:09:36 -0700

Tom Lane wrote:

Heikki Linnakangas <[EMAIL PROTECTED]> writes:
In any case, I do want this for VACUUMs to fix the "WAL flush for everydirty page" problem. Maybe we should indeed drop the other aspects ofthe patch and move on, I'm getting tired of this as well.
Can we devise a small patch that fixes that issue without creating
uncertain impacts on other behavior?


Yeah certainly, that's my backup plan.

The thing that has made me uncomfortable with this set of patches right
along is the presumption that it's a good idea to tweak cache behavior
to improve a small set of cases.  We are using cache behavior (now
clock-sweep, formerly LRU) that is well tested and proven across a large
body of experience; my gut feeling is that deviating from that behavior
is likely to be a net loss when the big picture is considered.  I
certainly have not seen any test results that try to prove that there's
no such generalized disadvantage to these patches.

That was my initial reaction as well. However, people claimed that usinga small number of buffers you can achieve higher raw throughput.Avoiding the cache spoiling sounds plausible as well, if you're going todo a seq scan of a table larger than shared_buffers, you know thosepages are not going to be needed in the near future; you're going toreplace them yourself with pages from the same scan.

One could argue that the real bug here is that VACUUM deviates from the
standard behavior by forcing immediate recycling of pages it releases,
and that getting rid of that wart is the correct answer rather than
piling more warts atop it --- especially warts that will change the
behavior for anything besides VACUUM.

Maybe a better approach would be switching to an algorithm that's moreresistant to sequential scans by nature. That's definitely not going tohappen for 8.3, though.

It's also possible that whatever algorithm we choose doesn't make muchdifference in practice because the in typical configurations the OScache is bigger and more significant anyway. It's also possible thatthere's some counter-productive interactions between our and OS cachemanagement.

In any case, I'd like to see more test results before we make adecision. I'm running tests with DBT-2 and a seq scan running in thebackground to see if the cache-spoiling effect shows up. I'm also tryingto get hold of some bigger hardware to run on. Running these tests takessome calendar time, but the hard work has already been done. I'm goingto start reviewing Jeff's synchronized scans patch now.


--
  Heikki Linnakangas
  EnterpriseDB   http://www.enterprisedb.com

---------------------------(end of broadcast)---------------------------
TIP 5: don't forget to increase your free space map settings

Re: [PATCHES] Seq scans status update

Reply via email to