Re: [HACKERS] buildfarm failures on smew and anole

2013-10-17 Thread Andres Freund
On 2013-10-16 09:35:46 -0400, Robert Haas wrote: Gah. I fixed one instance of that problem in test_config_settings(), but missed the other. Maybe it'd be better to default to none, just as max_connections defaults to 1 and shared_buffers to 16? As we write out the value in the config file,

Re: [HACKERS] buildfarm failures on smew and anole

2013-10-16 Thread Robert Haas
On Tue, Oct 15, 2013 at 11:17 PM, Peter Eisentraut pete...@gmx.net wrote: On Mon, 2013-10-14 at 18:14 -0400, Robert Haas wrote: I cleaned the semaphores on smew, but they came back. Whatever is crashing is leaving the semaphores lying around. Ugh. When did you do that exactly? I thought

Re: [HACKERS] buildfarm failures on smew and anole

2013-10-16 Thread Andres Freund
On 2013-10-16 08:39:10 -0400, Robert Haas wrote: On Tue, Oct 15, 2013 at 11:17 PM, Peter Eisentraut pete...@gmx.net wrote: On Mon, 2013-10-14 at 18:14 -0400, Robert Haas wrote: I cleaned the semaphores on smew, but they came back. Whatever is crashing is leaving the semaphores lying

Re: [HACKERS] buildfarm failures on smew and anole

2013-10-16 Thread Robert Haas
On Wed, Oct 16, 2013 at 8:54 AM, Andres Freund and...@2ndquadrant.com wrote: On 2013-10-16 08:39:10 -0400, Robert Haas wrote: On Tue, Oct 15, 2013 at 11:17 PM, Peter Eisentraut pete...@gmx.net wrote: On Mon, 2013-10-14 at 18:14 -0400, Robert Haas wrote: I cleaned the semaphores on smew, but

Re: [HACKERS] buildfarm failures on smew and anole

2013-10-16 Thread Robert Haas
On Wed, Oct 16, 2013 at 9:37 AM, Andres Freund and...@2ndquadrant.com wrote: On 2013-10-16 09:35:46 -0400, Robert Haas wrote: Gah. I fixed one instance of that problem in test_config_settings(), but missed the other. Maybe it'd be better to default to none, just as max_connections defaults

Re: [HACKERS] buildfarm failures on smew and anole

2013-10-16 Thread Andres Freund
On 2013-10-16 09:44:32 -0400, Robert Haas wrote: On Wed, Oct 16, 2013 at 9:37 AM, Andres Freund and...@2ndquadrant.com wrote: On 2013-10-16 09:35:46 -0400, Robert Haas wrote: Gah. I fixed one instance of that problem in test_config_settings(), but missed the other. Maybe it'd be better

Re: [HACKERS] buildfarm failures on smew and anole

2013-10-15 Thread Peter Eisentraut
On Mon, 2013-10-14 at 18:14 -0400, Robert Haas wrote: I cleaned the semaphores on smew, but they came back. Whatever is crashing is leaving the semaphores lying around. Ugh. When did you do that exactly? I thought I fixed the problem that was causing that days ago, and the last 4 days

Re: [HACKERS] buildfarm failures on smew and anole

2013-10-14 Thread Robert Haas
On Fri, Oct 11, 2013 at 4:03 PM, Andrew Dunstan and...@dunslane.net wrote: Can the owners of these buildfarm machines please check whether there are extra semaphores allocated and if so free them? Or at least reboot, to see if that unbreaks the build? It is possible to set the buildfarm

Re: [HACKERS] buildfarm failures on smew and anole

2013-10-14 Thread Andres Freund
On 2013-10-14 09:12:09 -0400, Robert Haas wrote: On Fri, Oct 11, 2013 at 4:03 PM, Andrew Dunstan and...@dunslane.net wrote: Can the owners of these buildfarm machines please check whether there are extra semaphores allocated and if so free them? Or at least reboot, to see if that unbreaks

Re: [HACKERS] buildfarm failures on smew and anole

2013-10-14 Thread Andrew Dunstan
On 10/14/2013 09:12 AM, Robert Haas wrote: On Fri, Oct 11, 2013 at 4:03 PM, Andrew Dunstan and...@dunslane.net wrote: Can the owners of these buildfarm machines please check whether there are extra semaphores allocated and if so free them? Or at least reboot, to see if that unbreaks the

Re: [HACKERS] buildfarm failures on smew and anole

2013-10-14 Thread Robert Haas
On Mon, Oct 14, 2013 at 9:22 AM, Andrew Dunstan and...@dunslane.net wrote: Maybe I didn't explain that well. The problem is that the regression tests require at least 20 connections to run, and those two machines are currently auto-selecting 10 connections, so make check is failing. Why do

Re: [HACKERS] buildfarm failures on smew and anole

2013-10-14 Thread Andres Freund
On 2013-10-14 09:28:04 -0400, Robert Haas wrote: # By convention, we put no more than twenty tests in any one parallel group; # this limits the number of connections needed to run the tests. If it's not supposed to matter how many connections are available, then that comment is misleading.

Re: [HACKERS] buildfarm failures on smew and anole

2013-10-14 Thread Tom Lane
Robert Haas robertmh...@gmail.com writes: Anyway, as Andres said, the machines were working fine until recently, so I think we just need to get them un-broken. I think you're talking past each other. What would be useful here is to find out *why* these machines are now failing, when they

Re: [HACKERS] buildfarm failures on smew and anole

2013-10-14 Thread Robert Haas
On Mon, Oct 14, 2013 at 1:33 PM, Tom Lane t...@sss.pgh.pa.us wrote: Robert Haas robertmh...@gmail.com writes: Anyway, as Andres said, the machines were working fine until recently, so I think we just need to get them un-broken. I think you're talking past each other. What would be useful

Re: [HACKERS] buildfarm failures on smew and anole

2013-10-14 Thread Peter Eisentraut
On Fri, 2013-10-11 at 15:33 -0400, Robert Haas wrote: Can the owners of these buildfarm machines please check whether there are extra semaphores allocated and if so free them? Or at least reboot, to see if that unbreaks the build? I cleaned the semaphores on smew, but they came back.

Re: [HACKERS] buildfarm failures on smew and anole

2013-10-14 Thread Robert Haas
On Mon, Oct 14, 2013 at 4:29 PM, Peter Eisentraut pete...@gmx.net wrote: On Fri, 2013-10-11 at 15:33 -0400, Robert Haas wrote: Can the owners of these buildfarm machines please check whether there are extra semaphores allocated and if so free them? Or at least reboot, to see if that unbreaks

Re: [HACKERS] buildfarm failures on smew and anole

2013-10-11 Thread Andrew Dunstan
On 10/11/2013 03:33 PM, Robert Haas wrote: The build is continuing to fail on smew and anole. The reason it's failing is because those machines are choosing max_connections = 10, which is not enough to run the regression tests. I think this is probably because of System V semaphore