I am running on 2.6.9-11. All the PID's are different.

Arunav.

----- Original Message ----- 
From: "Kern Sibbald" <[EMAIL PROTECTED]>
To: "Arunav Mandal" <[EMAIL PROTECTED]>
Cc: <bacula-users@lists.sourceforge.net>
Sent: Friday, September 23, 2005 8:59 AM
Subject: Re: [Bacula-users] Bacula dir crashing


> On Wednesday 21 September 2005 12:08, Arunav Mandal wrote:
> > Hi,
> >         You said to attach gdb to bacula-dir but there are 4 bacula-dir
> > running which one to attach to? In the documentation it says if I have
> > /lib/tls it may have some problems. I am running Centos 4.1. Anyway the
DIR
> > server has has only one cpu but was running smp kernel now I switched to
> > normail kernel.
>
> Uh, if there are 4 bacula-dir's running, then you must be running on a 2.4
> kernel, and in that case using /lib/tls will create no end of problems
with
> Bacula mostly bizarre hangs (missed signals).  Under a 2.6 kernel, Bacula
> should appear as a single process.
>
> In the case you are using a 2.4 kernel, you either need to move /lib/tls,
or
> disable it with the environment variable as documented in the manual.
When
> using the debugger on a 2.4 kernel, you always attach to the first PID,
but
> if you follow the instructions in the manual, you will run Bacula under
the
> debugger rather than attaching to it later.
>
> In the case of a 2.6 kernel, if there are multiple instances of Bacula,
you
> are most likely using an option that shows all the threads -- as opposed
to
> running under a 2.6 kernel, they should all have the same PID, or you have
> disabled /lib/tls.  This /lib/tls workaround is not needed for 2.6
kernels.
>
> >
> > Arunav.
> >
> > > Hello,
> > >
> > > I suspect that you have now set the record for having the most
problems
> >
> > with
> >
> > > Bacula, if not, you are close.  Unfortunately, that is a rather
> > > unpleasant distinction :-(
> > >
> > > If you are asking if your FileSet is correct, I don't see any major
> >
> > problems.
> >
> > > However, now that you have moved the wildcards to the Exclude
resource, I
> > > personally would remove the "Exclude = yes" from the Options.  It
should
> >
> > do
> >
> > > no harm, but it could make reading the FileSet confusing for someone
who
> > > doesn't know the history ...
> > >
> > > There is one known mutex race condition in 1.36.x that could cause a
> >
> > Director
> >
> > > hang, and perhaps you are more likely to see it than most users
because
> >
> > you
> >
> > > are running a *lot* of jobs every night, and if I am not mistaken, you
> >
> > have a
> >
> > > real smp system, which tends to make race conditions even more
evident.
> > >
> > > Note, I forgot to mention last time I emailed that if either the SD
*or*
> >
> > the
> >
> > > DIR crashes during a backup, the number of files on your tape is
likely
> > > to
> >
> > be
> >
> > > wrong (if the Director goes down the SD cannot update the catalog).
> > >
> > > If this happens again, please attach to the Director with the debugger
> >
> > using
> >
> > > something like:
> > >
> > >   gdb bacula-dir <pid>
> > >
> > > where you replace <pid>  with the PID of the Director, then produce a
> > > traceback as described in the Kaboom chapter of the manual.  At least
I
> >
> > can
> >
> > > verify if you are seeing a known problem.  This race bug is fixed in
> > > 1.37, but it was such a substantial fix that there is no patch for
1.36.
> > >
> > > On Friday 16 September 2005 10:50, Arunav Mandal wrote:
> > > > > On Wednesday 14 September 2005 08:45, Arunav Mandal wrote:
> > > > > > Now I got another problem bacula dir crashed without any reason.
> >
> > What
> >
> > > > debug
> > > >
> > > > > > level I should use to see what's going on?
> > > > >
> > > > > You should have gotten a traceback by email.  If not, you can
produce
> >
> > one
> >
> > > > by
> > > >
> > > > > running the Director under the debugger as described in the kaboom
> > > > > chapter
> > > >
> > > > of
> > > >
> > > > > the manual.
> > > > >
> > > > > > Arunav.
> > > >
> > > > It happened again yesterday night bacula dir didnt crash it seems
but
> >
> > when
> >
> > > > I tried to log into it in morning via bconsole I can't and there
were
> > > > no backup mails also.I changed nothing in the config file expect the
> >
> > Fileset
> >
> > > > given below. Fileset was correct isnt?
> > > >
> > > > FileSet {
> > > >   Name = linux-default
> > > >   Ignore Fileset changes = yes
> > > >   Include {
> > > >   Options {
> > > >   signature=SHA1
> > > >   verify=pins1
> > > >   onefs=no
> > > >   sparse=no
> > > >   Exclude = yes
> > > > }
> > > >   File = /
> > > >   }
> > > >
> > > >   Exclude {
> > > >         File = /sys
> > > >         File = /proc
> > > >         File = /tmp
> > > >         File = /.journal
> > > >         File = /.fsck
> > > >         File = /mnt
> > > >         File = /dev
> > > >         File = /var/chroot/hoary-ia32/home
> > > >         File = /space
> > > >         File = *.mp3
> > > >         File = *.m4a
> > > >         File = *.o
> > > >         File = *.obj
> > > >         File = *.vob
> > > >         File = *.VOB
> > > >         File = *.journal
> > > >         File = *.fsck
> > > >
> > > > }
> > > > }
> > > >
> > > >
> > > >
> > > > Arunav.
> > >
> > > --
> > > Best regards,
> > >
> > > Kern
> > >
> > >   (">
> > >   /\
> > >   V_V
>
> -- 
> Best regards,
>
> Kern
>
>   (">
>   /\
>   V_V
>



-------------------------------------------------------
SF.Net email is sponsored by:
Tame your development challenges with Apache's Geronimo App Server. 
Download it for free - -and be entered to win a 42" plasma tv or your very
own Sony(tm)PSP.  Click here to play: http://sourceforge.net/geronimo.php
_______________________________________________
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users

Reply via email to