Hi Kern,

2009/11/4 Kern Sibbald <k...@sibbald.com>:
> Yes, this is very useful.  It is not often that I am able to see a series of
> cascading errors generated by a real "database" error, so it gave me a chance
> to see how many times an error message is repeated, and where it gets
> distorted because the job thread must continue to the end but avoid trying to
> do anything that will cause another "false" error message.
I'm glad to hear it!

> I think I have cleaned up a good part of these error messages, but what is
> worrying me is that you say that Jobs still got stuck in the SD.  So, what
> would be most useful would be for you to tell me the *exact*  PostgreSQL
> config statement (including where it is) that I must change to invoke this
> error for documentation purposes.  I am going to add debug code to Bacula
> force the error by allowing a maximum of 2 and trying to start 10 jobs.  If I
> can duplicate the jobs getting "stuck", I can probably completely resolve it.
Steps to reproduce, then:

 1) Set up bacula with a large number of clients that run jobs concurrently.
 2) Edit postgresql.conf, set "max_connections = <small number>".
Restart PostgreSQL completely.
 3) Restart director (so it reconnects to PostgreSQL -- it seemed to
have problems with "status dir" on the console if this wasn't done).
 4) Run all client jobs at once, so bacula-dir connects to PostgreSQL
and exceeds max_connections.
 5) ???
 6) Profit!

> I'll be submitting some more patches to clean the error handling up a lot
> more, but I wouldn't recommend at this point that you attempt to take them.
Awesome, thankyou. I'll probably keep an eye on the git commit log but
I won't have much time to work on this from my end for a week or so
after this Friday due to other work commitments.

> I recommend sticking with what you have and either going back to 3.0.3 or
> preferrably testing the patch carefully before putting it into production.
I've given it a bit of a beating, but I am pretty satisfied that when
max_connections is set high enough I won't run into these problems
anyway, so I'm going to stick with the patched 3.0.3 packages I have
installed.

> If I find out how to "unstick" the stuck jobs, I will let you know.
Again, thanks for spending the time looking into this, it's really helpful!

--Alex

------------------------------------------------------------------------------
Let Crystal Reports handle the reporting - Free Crystal Reports 2008 30-Day 
trial. Simplify your report design, integration and deployment - and focus on 
what you do best, core application coding. Discover what's new with
Crystal Reports now.  http://p.sf.net/sfu/bobj-july
_______________________________________________
Bacula-devel mailing list
Bacula-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-devel

Reply via email to