[openstack-dev] [gate] concurrent workers are overwhelming postgresql in the gate - bug 1338841

Matt Riedemann Wed, 09 Jul 2014 13:01:51 -0700

Bug 1338841 [1] started showing up yesterday and I first noticed it onthe change to set osapi_volume_workers equal to the number of CPUsavailable by default. Similar patches for trove (api/conductor workers)and glance (api/registry workers) have landed in the last week also, andnova has been running with multiple api/conductor workers by defaultsince Icehouse.

It looks like the cinder change tipped the default postgresqlmax_connections over and we started getting asynchronous connectionfailures in that job. [2]

We can also note that the postgresql job is the only one that runs thenova api-metadata service, which has it's own workers.

The VMs the jobs are running on have 8 VCPUs, so that's at least 88workers between nova (3), cinder (1), glance (2), trove (2), neutron,heat and ceilometer.

So osapi_volume_workers (8) + n-api-meta workers (8) seems to havetipped it over.

The first attempt at a fix is to simply double the defaultmax_connections value [3].

While looking up the postgresql configuration docs, I also read a bit onsynchronous_commit=off and fsync=off, which sound like we might want toalso think about using one of those in devstack runs since they aresupposed to be more performant if you don't care about disaster recovery(which we don't in gate runs on VMs).

Anyway, bumping max connections might fix the gate, I'm just sendingthis out to see if there are any postgresql experts out there withadditional tips or insights on things we can tweak or look for,including whether or not it might be worthwhile to setsynchronous_commit=off or fsync=off for gate runs.


[1] https://bugs.launchpad.net/nova/+bug/1338841
[2] http://goo.gl/yRBDjQ
[3] https://review.openstack.org/#/c/105854/

--

Thanks,

Matt Riedemann


_______________________________________________
OpenStack-dev mailing list
OpenStack-dev@lists.openstack.org
http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-dev

[openstack-dev] [gate] concurrent workers are overwhelming postgresql in the gate - bug 1338841

Reply via email to