Our backup server has spontaneously stopped doing backups of itself, after
working fine for several months. it backs up (most) other clients fine; but
the daily mail message says:

  localhost  /var/tmp lev 0 FAILED [Request to localhost timed out.]
  localhost  /var/lib/rpm lev 0 FAILED [Request to localhost timed out.]
  localhost  /etc lev 0 FAILED [Request to localhost timed out.]

the disklist entry for localhost is:
localhost /etc localhost-nocomp-5am -1 local
localhost /var/lib/rpm tar-comp-high -1 local #(whoops, forgot to change the
dumptype last night).
localhost /var/tmp localhost-nocomp-5am -1 local

and the amanda.conf entries for  those dumptypes are:
define dumptype global {
    comment "Global definitions"
    # This is quite useful for setting global parameters, so you don't have
    # to type them everywhere.  All dumptype definitions in this sample file
    # do include these definitions, either directly or indirectly.
    # There's nothing special about the name `global'; if you create any
    # dumptype that does not contain the word `global' or the name of any
    # other dumptype that contains it, these definitions won't apply.
    # Note that these definitions may be overridden in other
    # dumptypes, if the redefinitions appear *after* the `global'
    # dumptype name.
    # You may want to use this for globally enabling or disabling
    # indexing, recording, etc.  Some examples:
    index yes
    # record no
}

define dumptype tar-comp-high {
    comment "very important partitions on fast machines"
    global
    program "GNUTAR"
    compress client best
    priority high
}

define dumptype localhost-nocomp-5am {
        comment "localhost, so don't compress. don't use the holdingdisk, so
it won't beat up our one hdd"
        global
        program "GNUTAR"
        compress none
        holdingdisk no
        starttime 500  # for some reason this doesn't work
        priority low
}

I've been messing around with options lately; so if some things don't make
sense, that's probably why. please tell me what could be better, tho. :)

this thing *had* been working fine for some months; the only thing that
changed, was that one client machine was reinstalled/upgraded, and had
screwed-up NFS mounts for a while. that's been fixed now, but it seems like
it was interfering with amanda backups for a while, on those clients that
had non-functioning/stale NFS mounts from that box.

there's plenty of disk space.

the debug files in /tmp/amanda seem fine and normal; the amandad and runtar
files exist and don't have any errors. the 'sendbackup' debug file doesn't
exist, tho. :(
(and yes, the sendbackup binary exists and has the same permissions as a
working system does).

looks like there's only one 'amandad' connect notice in /var/log/secure,
rather than one for each disk to be backed up, like on other systems. I
don't see any errors in syslog or other logs, tho.

where do I continue debugging?

Carl Soderstrom.
-- 
Network Engineer
Real-Time Enterprises
(952) 943-8700

Reply via email to