Re: [HACKERS] Scaling shared buffer eviction

Gregory Smith Tue, 23 Sep 2014 21:03:03 -0700

On 9/23/14, 7:13 PM, Robert Haas wrote:

I think we expose far too little information in our system views. Justto take one example, we expose no useful information about lwlockacquire or release, but a lot of real-world performance problems arecaused by lwlock contention.

I sent over a proposal for what I was calling Performance Events about ayear ago. The idea was to provide a place to save data about lockcontention, weird checkpoint sync events, that sort of thing. Replacinglog parsing to get at log_lock_waits data was my top priority. Oncethat's there, lwlocks was an obvious next target. Presumably we justneeded collection to be low enough overhead, and then we can go down towhatever shorter locks we want; lower the overhead, faster the event wecan measure.

Sometimes the database will never be able to instrument some of itsfastest events without blowing away the event itself. We'll still haveperf / dtrace / systemtap / etc. for those jobs. But those are not theproblems of the average Postgres DBA's typical day.

The data people need to solve this sort of thing in production can'talways show up in counters. You'll get evidence the problem is there,but you need more details to actually find the culprit. Some info aboutthe type of lock, tables and processes involved, maybe the query that'srunning, that sort of thing. You can kind of half-ass the job if youmake per-tables counter for everything, but we really need more, both toserve our users and to compare well against what other databases providefor tools. That's why I was trying to get the infrastructure to captureall that lock detail, without going through the existing logging systemfirst.

Actually building Performance Events fell apart on the storage side:figuring out where to put it all without waiting for a log file to hitdisk. I wanted in-memory storage so clients don't wait for anything,then a potentially lossy persistence writer. I thought I could get awaywith a fixed size buffer like pg_stat_statements uses. That wasoptimistic. Trying to do better got me lost in memory management landwithout making much progress.

I think the work you've now done on dynamic shared memory gives theright shape of infrastructure that I could pull this off now. I evenhave funding to work on it again, and it's actually the #2 thing I'dlike to take on as I get energy for new feature development. (#1 is thesimple but time consuming job of adding block write counters, the lackof which which is just killing me on some fast growing installs)

I have a lot of unread messages on this list to sort through right now.I know I saw someone try to revive the idea of saving new sorts ofperformance log data again recently; can't seem to find it again rightnow. That didn't seem like it went any farther than thinking about thespecifications though. The last time I jumped right over that and hit awall with this one hard part of the implementation instead, low overheadmemory management for saving everything.


--
Greg Smith greg.sm...@crunchydatasolutions.com
Chief PostgreSQL Evangelist - http://crunchydatasolutions.com/


--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] Scaling shared buffer eviction

Reply via email to