As stated in my email, my client and server are the same machine.
Here are the permissions on .amandahosts:
-rw------- 1 backup disk 174 May 22 13:14
/root/amanda-data/.amandahosts
Adding root entries, like this to .amandahosts:
10.1.0.2 backup
10.1.0.3 backup
server02.ct backup
server01.ct backup
broadcast backup
10.1.0.2 root
10.1.0.3 root
server02.ct root
server01.ct root
broadcast root
did not change any behavior. The debug/log output is identical.
amcheck still returns:
WARNING: server02.ct: selfcheck request timed out. Host down?
Client check: 1 host checked in 30.036 seconds, 1 problem found
Thank you,
Cory Visi
"Martinez, Michael -
CSREES/ISTM" To: 'Cory Visi'
<[EMAIL PROTECTED]>,
<[EMAIL PROTECTED] [EMAIL PROTECTED]
eusda.gov> cc:
Subject: RE: Me again, still
getting
05/22/2002 12:29 PM WARNING: host: selfcheck request
time d out.
Host down?
You're .amandahosts on your tape server needs to include a line for "root"
for all your tape clients (including itself) ; and .amandahosts on your
clients needs to be chmod 400
-----Original Message-----
From: Cory Visi [mailto:[EMAIL PROTECTED]]
Sent: Wednesday, May 22, 2002 11:36 AM
To: [EMAIL PROTECTED]
Subject: Me again, still getting WARNING: host: selfcheck request timed
out. Host down?
Ok, I have looked into my issues further. Many replies on the list have
helped me, although I am still having problems and I do not understand what
Amanda is doing.
Firstly, this error:
amandad: dgram_send_addr: sendto(0.0.0.0.929) failed: Invalid argument
Is clearly not indicative of an error. Amanda uses broadcasts to find
clients/servers regardless of the configuration. Some people get this error
but have a perfectly working system.
This error, however:
Amanda Backup Client Hosts Check
--------------------------------
WARNING: server02.ct: selfcheck request timed out. Host down?
Client check: 1 host checked in 30.028 seconds, 1 problem found
IS indicative of a problem.
So, starting from scratch I will explain all the details of my setup:
I am using Amanda 2.4.2p2 on Linux 2.4.13 with glibc 2.2.2 compiled by
myself.
I am running amandad, amindexd, amidxtaped from xinetd. All these protocols
are listed in my /etc/services.
Currently, I am only trying to backup the data on the same machine as the
server, so the server and client are the _same machine_. I am using the
eth0 interface IPs to communicate, although, clearly the packets will go
through the loopback. The local firewall is configured to _allow all
packets_ through on the loopback and the eth0 interface (since it's local
only).
The following are relevant files concerning my setup. Some of my setup may
seem redundant and superfluous, however, in my efforts to make things work,
I have tried a number of things.
/etc/passwd entry for amanda user:
backup:x:417:6:Amanda Backup:/root/amanda-data:/bin/bash
/root/amanda-data/.amandahosts:
10.1.0.3 backup
server02.ct backup
broadcast backup
/etc/hosts:
127.0.0.1 localhost.localdomain localhost
10.1.0.3 server02.ct.internaldomain.com server02.ct
0.0.0.0 broadcast broadcast
some of amanda.conf:
dumpuser "backup"
inparallel 4
netusage 51200 Kbps
dumpcycle 3 weeks
runspercycle 15
tapecycle 20 tapes
runtapes 1
tpchanger "chg-manual"
tapedev "/dev/nst0"
changerfile "/root/amanda-data/changer-status"
changerdev "/dev/null"
tapetype DAT
infofile "/root/amanda-data/daily"
logdir "/var/log/amanda/daily"
indexdir "/root/amanda-data/daily"
tapelist "/root/amanda-data/daily/tapelist"
define tapetype DAT {
comment "Archive Python 04687-XXX"
length 4000 mbytes
filemark 100 kbytes
speed 800 kbytes
}
disklist:
server02.ct sda1 comp-root
server02.ct sda2 comp-user
server02.ct sdb1 comp-user
xinetd entries:
service amanda
{
disable = no
bind = 10.1.0.3
log_on_success += USERID
log_on_failure += USERID
socket_type = dgram
protocol = udp
wait = yes
user = root
server = /usr/local/libexec/amandad
}
service amandaidx
{
disable = no
bind = 10.1.0.3
log_on_success += USERID
log_on_failure += USERID
socket_type = stream
protocol = tcp
wait = no
user = backup
server = /usr/local/libexec/amindexd
}
service amidxtape
{
disable = no
bind = 10.1.0.3
log_on_success += USERID
log_on_failure += USERID
socket_type = stream
protocol = tcp
wait = no
user = backup
server = /usr/local/libexec/amidxtaped
}
Following are my debug files for amcheck.
amcheck.debug:
amcheck: debug 1 pid 24151 ruid 417 euid 0 start time Wed May 22 10:29:30
2002
amcheck: dgram_bind: socket bound to 0.0.0.0.929
amcheck: pid 24151 finish time Wed May 22 10:30:00 2002
selfcheck.debug:
selfcheck: checking disk sdb1
selfcheck: device /dev/sdb1
selfcheck: OK
selfcheck: checking disk sda2
selfcheck: device /dev/sda2
selfcheck: OK
selfcheck: checking disk sda1
selfcheck: device /dev/sda1
selfcheck: OK
selfcheck: pid 8251 finish time Wed May 22 10:29:30 2002
amandad.debug:
got packet:
--------
Amanda 2.4 REQ HANDLE 000-48C30708 SEQ 1022077770
SECURITY USER backup
SERVICE selfcheck
OPTIONS ;
DUMP sdb1 0 OPTIONS |;bsd-auth;compress-fast;
DUMP sda2 0 OPTIONS |;bsd-auth;compress-fast;
DUMP sda1 0 OPTIONS |;bsd-auth;compress-fast;
--------
sending ack:
----
Amanda 2.4 ACK HANDLE 000-48C30708 SEQ 1022077770
----
amandad: dgram_send_addr: sendto(0.0.0.0.929) failed: Invalid argument
bsd security: remote host broadcast user backup local user backup
amandahosts security check passed
amandad: running service "/usr/local/libexec/selfcheck"
amandad: sending REP packet:
----
Amanda 2.4 REP HANDLE 000-48C30708 SEQ 1022077770
OPTIONS ;
OK /dev/sdb1
OK /dev/sda2
OK /dev/sda1
OK /sbin/dump executable
OK /bin/gzip executable
OK /etc/dumpdates read/writable
OK /dev/null read/writable
OK /tmp/amanda has more than 64 KB available.
OK /tmp/amanda has more than 64 KB available.
OK /etc has more than 64 KB available.
----
amandad: dgram_send_addr: sendto(0.0.0.0.929) failed: Invalid argument
You can see that amanda is communicating ONLY over the broadcast. Thanks,
and only thanks, to my manipulation of /etc/hosts and .amandahosts, this is
producing any output at ALL. If I remove the broadcast hosts and
.amandahosts entry, selfcheck doesn't even RUN.
In any case, this is how I interpret the above output (please correct me if
I'm wrong). The client is contacting the server over the broadcast address
0.0.0.0 as we have discovered amanda always does, however, that's it. It's
not contacting the server over a unicast address at all! amandad receives
the request, validates the user, and processes the request. Upon attempting
to send back the output, it fails, because you can not have one-to-one
communication on a broadcast address (i.e. Invalid argument).
Why does not the client try to use the unicast address I have specified in
the disklist? Even though the entries clearly resolve, I have tried using
the IP also with the same results. Why is amcheck only binding to 0.0.0.0
and not 10.1.0.3? What am I doing wrong?
Thank you,
Cory Visi