Re: [HACKERS] Bug: Buffer cache is not scan resistant

Jeff Davis Tue, 06 Mar 2007 10:00:23 -0800

On Mon, 2007-03-05 at 21:02 -0700, Jim Nasby wrote:
> On Mar 5, 2007, at 2:03 PM, Heikki Linnakangas wrote:
> > Another approach I proposed back in December is to not have a  
> > variable like that at all, but scan the buffer cache for pages  
> > belonging to the table you're scanning to initialize the scan.  
> > Scanning all the BufferDescs is a fairly CPU and lock heavy  
> > operation, but it might be ok given that we're talking about large  
> > I/O bound sequential scans. It would require no DBA tuning and  
> > would work more robustly in varying conditions. I'm not sure where  
> > you would continue after scanning the in-cache pages. At the  
> > highest in-cache block number, perhaps.
> 
> If there was some way to do that, it'd be what I'd vote for.
>


I still don't know how to make this take advantage of the OS buffer
cache. 

However, no DBA tuning is a huge advantage, I agree with that.

If I were to implement this idea, I think Heikki's bitmap of pages
already read is the way to go. Can you guys give me some pointers about
how to walk through the shared buffers, reading the pages that I need,
while being sure not to read a page that's been evicted, and also not
potentially causing a performance regression somewhere else?

> Given the partitioning of the buffer lock that Tom did it might not  
> be that horrible for many cases, either, since you'd only need to  
> scan through one partition.
> 
> We also don't need an exact count, either. Perhaps there's some way  
> we could keep a counter or something...

Exact count of what? The pages already in cache?

Regards,
        Jeff Davis


---------------------------(end of broadcast)---------------------------
TIP 9: In versions below 8.0, the planner will ignore your desire to
       choose an index scan if your joining column's datatypes do not
       match

Re: [HACKERS] Bug: Buffer cache is not scan resistant

Reply via email to