Re: [HACKERS] Proposal for CSN based snapshots

Alexander Kuzmenkov Thu, 14 Dec 2017 10:55:49 -0800

El 08/12/17 a las 14:59, Alexander Korotkov escribió:

These results look promising for me. Could you try benchmarking usingmore workloads including read-only and mixed mostly-read workloads?You can try same benchmarks I used in my talk about CSN in pgconf.eu<http://pgconf.eu> [1] slides 19-25 (and you're welcome to invent morebenchmakrs yourself)


Sure, here are some more benchmarks.

I've already had measured the "skip some updates" and "select-only"pgbench variants, and also a simple "select 1" query. These are forscale 1500 and for 20 to 1000 connections. The graphs are attached."select 1", which basically benchmarks snapshot taking, shows animpressive twofold increase in TPS over master, but this is to beexpected. "select-only" stabilizes at 20% higher than master.Interesting to note is that these select-only scenarios almost do notdegrade with growing client count.For the "skip some updates" scenario, CSN is slightly slower thanmaster, but this is improved by the LWLock patch I mentioned upthread.

I also replicated the setup from your slides 23 and 25. I used scale 500and client counts 20-300, and probably the same 72-core Xeon.Slide 23 shows 22% write and 78% read queries, that is "-b select-only@9-b tpcb-like@1". The corresponding picture is called "random.png". Theabsolute numbers are somewhat lower for my run, but CSN is about 40%faster than master, like the CSN-rewrite variant.Slide 25 is a custom script called "rrw" with extra 20 read queries. Wecan see that since your run the master has improved much, and thecurrent CSN shows the same general behaviour as CSN-rewrite, althoughbeing slower in absolute numbers.

Also, I wonder how current version of CSN patch behaves in worst casewhen we have to scan the table with a lot of unique xid (andcorrespondingly have to do a lot of csnlog lookups)? See [1] slide18. This worst case was significant part of my motivation to try"rewrite xid with csn" approach. Please, find simple extension I usedto fill table with random xmins in the attachment.

OK, let's summarize how the worst case in question works. It happenswhen the tuples have a wide range of xmins and no hint bits. Allvisibility checks have to go to clog then. Fortunately, on master wealso set hint bits during these checks, and the subsequent scans candetermine visibility using just the hint bits and the current snapshot.This makes the subsequent scans run much faster. With CSN, this is notenough, and the visibility checks always go to CSN log for all thetransactions newer than global xmin. This can become very slow if thereis a long-running transaction that holds back the global xmin.

I made a simple test to see these effects. The procedure is as follows.I start a transaction in psql; that will be our long-runningtransaction. Next, I run pgbench for a minute, and randomize the tuplexmins in "pgbench_accounts" using your extension. Then I run "selectsum(abalance) from pgbench_accounts" twice, and record the durations.Here is a table with the results (50M tuples, 1M transactions for masterand 400k for CSN):


Branch    scan 1, s   scan 2, s
--------------------------------
CSN          80          80
master       13          3.5

So, we are indeed seeing the expected results. Significant slowdown withlong-running transaction is an important problem for this patch. Eventhe first scan is much slower, because a CSN log page contains 16 timesless transactions than a clog page, and we have the same number ofbuffers (max 128) for them. When the range of xids is wide, we spendmost of the time loading and unloading the pages. I believe it can beimproved by using more buffers for CSN log. I did some work in thisdirection, namely, made SLRUs use a dynahash table instead of linearsearch for page->buffer lookups. This is included in the v8 I postedearlier, but it should probably be done as a separate patch.


--
Alexander Kuzmenkov
Postgres Professional: http://www.postgrespro.com
The Russian Postgres Company

Re: [HACKERS] Proposal for CSN based snapshots

Reply via email to