Re: [PERFORM] Distributed transactions and asynchronous commit

Giuseppe Broccolo Wed, 17 Jul 2013 04:15:39 -0700

Il 17/07/2013 12:52, Xenofon Papadopoulos ha scritto:

Thank you for your replies so far.
The DB in question is Postgres+ 9.2 running inside a VM with thefollowing specs:
16 CPUs (dedicated to the VM)
60G RAM
RAID-10 storage on a SAN for pgdata and pgarchieves, using differentLUNs for each.
We have 3 kind of queries:
- The vast majority of the queries are small SELECT/INSERT/UPDATEswhich are part of distributed transactions
- A few small ones, which are mostly SELECTs
- A few bulk loads, where we add 100k - 1M of rows in tables

Our settings are:

shared_buffers: 8G
work_mem: 12M
checkpoint_segments: 64

shared_buffers could be set up to 20-30% of the available RAM: in yourcase, 16GB could be a reasonable value.

Autovacuum is somewhat aggressive, as our data changes quite often andwithout it the planner was completely off.

Right now we use:

 autovacuum_analyze_scale_factor: 0.1
 autovacuum_analyze_threshold: 50
autovacuum_freeze_max_age: 200000000
 autovacuum_max_workers: 12
 autovacuum_naptime: 10s
 autovacuum_vacuum_cost_delay: 20ms
 autovacuum_vacuum_cost_limit: -1
 autovacuum_vacuum_scale_factor: 0.2
 autovacuum_vacuum_threshold: 50

This means that auto vacuum will be triggered after around 50 updatesaech time, if your database is doing a lot of updates/inserts (as Iunderstood) an unnecessary amount of vacuum statements can be reached,which will generate a lot of IO. If the inserts/updates are small, thisvalue could be decreased.


Giuseppe.

At high-peak hour, the disk utilization for the pgdata mountpoint is:
*00:00:01 DEV tps rd_sec/s wr_sec/s avgrq-szavgqu-sz await svctm %util*13:20:01 dev253-2 7711.62 24166.97 56657.95 10.48 735.2895.09 0.11 86.1113:30:01 dev253-2 5340.88 19465.30 39133.32 10.97 319.2059.94 0.15 82.3013:40:01 dev253-2 2791.02 13061.76 19330.40 11.61 349.95125.38 0.33 90.7313:50:01 dev253-2 3478.69 10503.84 25505.27 10.35 308.1288.57 0.20 68.1214:00:01 dev253-2 5269.12 33613.43 35830.13 13.18 232.4844.09 0.19 100.0514:10:01 dev253-2 4910.24 21767.22 33970.96 11.35 322.5265.64 0.21 104.5514:20:02 dev253-2 5358.95 40772.03 33682.46 13.89 721.81134.32 0.20 104.9214:30:01 dev253-2 4420.51 17256.16 33315.27 11.44 336.5376.13 0.15 65.2514:40:02 dev253-2 4884.13 28439.26 31604.76 12.29 265.3254.26 0.20 97.5114:50:01 dev253-2 3124.91 8077.46 22511.59 9.79 50.4116.13 0.24 76.17
and for pgarchives:
*00:00:01 DEV tps rd_sec/s wr_sec/s avgrq-szavgqu-sz await svctm %util*13:20:01 dev253-3 2802.25 0.69 22417.32 8.00 465.05165.94 0.02 4.3213:30:01 dev253-3 1559.87 11159.45 12120.99 14.92 64.1741.11 0.08 12.0213:40:01 dev253-3 922.62 8066.62 7129.15 16.47 19.7521.40 0.08 6.9913:50:01 dev253-3 1194.81 895.34 9524.53 8.72 28.4023.76 0.01 1.6914:00:01 dev253-3 1919.12 0.46 15352.49 8.00 51.7526.95 0.01 1.6114:10:01 dev253-3 1770.59 9286.61 13873.79 13.08 139.8678.97 0.08 14.4614:20:02 dev253-3 1595.04 11810.63 12389.08 15.17 109.1768.42 0.15 24.7114:30:01 dev253-3 1793.71 12173.88 13957.79 14.57 141.5678.89 0.08 13.6114:40:02 dev253-3 1751.62 0.43 14012.53 8.00 43.3824.76 0.01 1.4014:50:01 dev253-3 1351.72 3225.19 10707.29 10.31 31.9123.59 0.02 2.93
On Wed, Jul 17, 2013 at 1:09 PM, Giuseppe Broccolo<[email protected]<mailto:[email protected]>> wrote:
    Hi,

    Il 17/07/2013 09:18, Xenofon Papadopoulos ha scritto:
    In the asynchronous commit documentation, it says:

    /The commands supporting two-phase commit, such as PREPARE
    TRANSACTION, are also always synchronous
    /

    Does this mean that all queries that are part of a distributed
    transaction are synchronous?

    In our databases we have extremely high disk I/O, I'm wondering
    if distributed transactions may be the reason behind it.
    Distributed transactions are base on two-phase-commit (2PC)
    algorithms for ensuring correct transaction completion,  so are
    synchronous.
    However, I think this is not the main reason behind your extremely
    high disk I/O. You should check if your system is properly tuned
    to get the best performances.
    First of all, you could take a look on your PostgreSQL
    configurations, and check if shared_memory is set properly taking
    into account your RAM availability. The conservative PostgreSQL
    default value is 24 MB, forcing system to exploit many disk I/O
    resources.
    Aside from this, you could take a look if autovacuum is often
    triggered (generating a large amount of I/O) in case of large use
    of updates/inserts in your database.

    Regards,

    Giuseppe.
--Giuseppe Broccolo - 2ndQuadrant Italy
    PostgreSQL Training, Services and Support
    [email protected]  <mailto:[email protected]>  
|www.2ndQuadrant.it  <http://www.2ndQuadrant.it>



--
Giuseppe Broccolo - 2ndQuadrant Italy
PostgreSQL Training, Services and Support
[email protected] | www.2ndQuadrant.it

Re: [PERFORM] Distributed transactions and asynchronous commit

Reply via email to