Re: [HACKERS] Bug: Buffer cache is not scan resistant

Jim Nasby Mon, 05 Mar 2007 22:54:01 -0800

On Mar 5, 2007, at 11:46 AM, Josh Berkus wrote:

Tom,
I seem to recall that we've previously discussed the idea ofletting the
clock sweep decrement the usage_count before testing for 0, so that a
buffer could be reused on the first sweep after it was initiallyused,
but that we rejected it as being a bad idea.  But at least with large
shared_buffers it doesn't sound like such a bad idea.
We did discuss an number of formulas for setting buffers withdifferentclock-sweep numbers, including ones with higher usage_count forindexes andstarting numbers of 0 for large seq scans as well as vacuums.However, wedidn't have any way to prove that any of these complex algorithmswouldresult in higher performance, so went with the simplest formula,with theidea of tinkering with it when we had more data. So maybe now'sthe time.
Note, though, that the current algorithm is working very, very wellfor OLTPbenchmarks, so we'd want to be careful not to gain performance inone area at
the expense of another.  In TPCE testing, we've been able to increase
shared_buffers to 10GB with beneficial performance effect (numbersposted
when I have them) and even found that "taking over RAM" with the
shared_buffers (ala Oracle) gave us equivalent performance to usingthe FScache. (yes, this means with a little I/O management engineeringwe couldcontemplate discarding use of the FS cache for a net performancegain. Maybe
for 8.4)

An idea I've been thinking about would be to have the bgwriter orsome other background process actually try and keep the free listpopulated, so that backends needing to grab a page would be much morelikely to find one there (and not have to wait to scan through theentire buffer pool, perhaps multiple times).

My thought is to keep track of how many page requests occurred duringa given interval, and use that value (probably averaged over time) todetermine how many pages we'd like to see on the free list. Thebackground process would then run through the buffers decrementingusage counts until it found enough for the free list. Before puttinga buffer on the 'free list', it would write the buffer out; I'm notsure if it would make sense to de-associate the buffer with whateverit had been storing or not, though. If we don't do that, that wouldmean that we could pull pages back off the free list if we wanted to.That would be helpful if the background process got a bit over-zealous.

--
Jim Nasby                                            [EMAIL PROTECTED]
EnterpriseDB      http://enterprisedb.com      512.569.9461 (cell)



---------------------------(end of broadcast)---------------------------
TIP 9: In versions below 8.0, the planner will ignore your desire to
      choose an index scan if your joining column's datatypes do not
      match

Re: [HACKERS] Bug: Buffer cache is not scan resistant

Reply via email to