Hello,
I've run into a situation with my multi-year amanda setup similar to
problem described here:
http://archives.zmanda.com/amanda-archives/viewtopic.php?t=7223&postdays=0&postorder=asc&highlight=error+sending+req+write+error&start=0
A working setup for years, suddenly one specific client starts to fail
amcheck. However, amcheck on single client works fine. Same for amdump;
all clients fails, run against single problem client works fine.
Later, noted that if one, working host is down, then problem host works
fine, amcheck and amdump.
My best guess on trigger for this system is updates applied. This is an
Ubuntu 14.04 system, the amanda packages weren't updated but many
libraries were.
Any pointers on what to chase here?
Versions:
Server: 3.3.1-4, Debian 7.9, 32-bit (possible that matters?)
Problem clients: 3.3.3-2, Ubuntu 14.04
debug from amcheck -c daily on problem host:
Mon Feb 22 08:45:04 2016: thd-0xed6600: amandad: pid 21287 ruid 34 euid
34 version 3.3.3: start at Mon Feb 22 08:45:04 2016
Mon Feb 22 08:45:04 2016: thd-0xed6600: amandad:
security_getdriver(name=bsdtcp) returns 0x7f5a725c99c0
Mon Feb 22 08:45:04 2016: thd-0xed6600: amandad: version 3.3.3
Mon Feb 22 08:45:04 2016: thd-0xed6600: amandad: build:
VERSION="Amanda-3.3.3"
Mon Feb 22 08:45:04 2016: thd-0xed6600: amandad: BUILT_DATE="Tue Jan 7
21:16:20 UTC 2014" BUILT_MACH=""
Mon Feb 22 08:45:04 2016: thd-0xed6600: amandad: BUILT_REV="5099"
BUILT_BRANCH="community_3_3_3" CC="gcc"
Mon Feb 22 08:45:04 2016: thd-0xed6600: amandad: paths:
bindir="/usr/sbin" sbindir="/usr/sbin"
Mon Feb 22 08:45:04 2016: thd-0xed6600: amandad:
libexecdir="/usr/lib/amanda"
Mon Feb 22 08:45:04 2016: thd-0xed6600: amandad:
amlibexecdir="/usr/lib/amanda" mandir="/usr/share/man"
Mon Feb 22 08:45:04 2016: thd-0xed6600: amandad: AMANDA_TMPDIR="/tmp/amanda"
Mon Feb 22 08:45:04 2016: thd-0xed6600: amandad:
AMANDA_DBGDIR="/var/log/amanda" CONFIG_DIR="/etc/amanda"
Mon Feb 22 08:45:04 2016: thd-0xed6600: amandad: DEV_PREFIX="/dev/"
RDEV_PREFIX="/dev/r"
Mon Feb 22 08:45:04 2016: thd-0xed6600: amandad: DUMP="/sbin/dump"
RESTORE="/sbin/restore" VDUMP=UNDEF
Mon Feb 22 08:45:04 2016: thd-0xed6600: amandad: VRESTORE=UNDEF
XFSDUMP="/sbin/xfsdump"
Mon Feb 22 08:45:04 2016: thd-0xed6600: amandad:
XFSRESTORE="/sbin/xfsrestore" VXDUMP=UNDEF VXRESTORE=UNDEF
Mon Feb 22 08:45:04 2016: thd-0xed6600: amandad:
SAMBA_CLIENT="/usr/bin/smbclient" GNUTAR="/bin/tar"
Mon Feb 22 08:45:04 2016: thd-0xed6600: amandad:
COMPRESS_PATH="/bin/gzip" UNCOMPRESS_PATH="/bin/gzip"
Mon Feb 22 08:45:04 2016: thd-0xed6600: amandad: LPRCMD=UNDEF MAILER=UNDEF
Mon Feb 22 08:45:04 2016: thd-0xed6600: amandad:
listed_incr_dir="/var/lib/amanda/gnutar-lists"
Mon Feb 22 08:45:04 2016: thd-0xed6600: amandad: defs:
DEFAULT_SERVER="localhost" DEFAULT_CONFIG="DailySet1"
Mon Feb 22 08:45:04 2016: thd-0xed6600: amandad:
DEFAULT_TAPE_SERVER="localhost" DEFAULT_TAPE_DEVICE=""
Mon Feb 22 08:45:04 2016: thd-0xed6600: amandad: NEED_STRSTR
AMFLOCK_POSIX AMFLOCK_FLOCK AMFLOCK_LOCKF
Mon Feb 22 08:45:04 2016: thd-0xed6600: amandad: AMFLOCK_LNLOCK
SETPGRP_VOID AMANDA_DEBUG_DAYS=4 BSD_SECURITY
Mon Feb 22 08:45:04 2016: thd-0xed6600: amandad: USE_AMANDAHOSTS
CLIENT_LOGIN="backup" CHECK_USERID HAVE_GZIP
Mon Feb 22 08:45:04 2016: thd-0xed6600: amandad: COMPRESS_SUFFIX=".gz"
COMPRESS_FAST_OPT="--fast"
Mon Feb 22 08:45:04 2016: thd-0xed6600: amandad:
COMPRESS_BEST_OPT="--best" UNCOMPRESS_OPT="-dc"
Mon Feb 22 08:45:04 2016: thd-0xed6600: amandad:
security_handleinit(handle=0xee18f0, driver=0x7f5a725c99c0 (BSDTCP))
Mon Feb 22 08:45:04 2016: thd-0xed6600: amandad:
security_streaminit(stream=0xee1ad0, driver=0x7f5a725c99c0 (BSDTCP))
Mon Feb 22 08:45:04 2016: thd-0xed6600: amandad: authenticated peer name
is 'croaker.c.c'
Mon Feb 22 08:45:04 2016: thd-0xed6600: amandad: accept recv REQ pkt:
<<<<<
SERVICE noop
OPTIONS features=ffffffff9efefbffffffffff1f;
Mon Feb 22 08:45:04 2016: thd-0xed6600: amandad: creating new service: noop
OPTIONS features=ffffffff9efefbffffffffff1f;
Mon Feb 22 08:45:04 2016: thd-0xed6600: amandad: sending ACK pkt:
<<<<<
Mon Feb 22 08:45:04 2016: thd-0xed6600: amandad: tcpm_send_token: data
is still flowing
Mon Feb 22 08:45:05 2016: thd-0xed6600: amandad: sending REP pkt:
<<<<<
OPTIONS features=ffffffff9efefbffffffffff1f;
Mon Feb 22 08:45:15 2016: thd-0xed6600: amandad: timeout
Mon Feb 22 08:45:15 2016: thd-0xed6600: amandad: sending REP pkt:
<<<<<
OPTIONS features=ffffffff9efefbffffffffff1f;
Mon Feb 22 08:45:15 2016: thd-0xed6600: amandad: tcpm_send_token: data
is still flowing
Mon Feb 22 08:45:25 2016: thd-0xed6600: amandad: timeout
Mon Feb 22 08:45:25 2016: thd-0xed6600: amandad: sending REP pkt:
<<<<<
OPTIONS features=ffffffff9efefbffffffffff1f;
Mon Feb 22 08:45:35 2016: thd-0xed6600: amandad: timeout
Mon Feb 22 08:45:35 2016: thd-0xed6600: amandad: sending REP pkt:
<<<<<
OPTIONS features=ffffffff9efefbffffffffff1f;
Mon Feb 22 08:45:35 2016: thd-0xed6600: amandad: tcpm_send_token: data
is still flowing
Mon Feb 22 08:45:45 2016: thd-0xed6600: amandad: timeout
Mon Feb 22 08:45:45 2016: thd-0xed6600: amandad: sending REP pkt:
<<<<<
OPTIONS features=ffffffff9efefbffffffffff1f;
Mon Feb 22 08:45:55 2016: thd-0xed6600: amandad: timeout
Mon Feb 22 08:45:55 2016: thd-0xed6600: amandad: timeout waiting for ACK
for our REP
Mon Feb 22 08:45:55 2016: thd-0xed6600: amandad:
security_close(handle=0xee18f0, driver=0x7f5a725c99c0 (BSDTCP))
Mon Feb 22 08:45:55 2016: thd-0xed6600: amandad:
security_stream_close(0xee1ad0)
Mon Feb 22 08:45:55 2016: thd-0xed6600: amandad: tcpm_send_token: data
is still flowing
Mon Feb 22 08:46:04 2016: thd-0xed6600: amandad: timeout exit
Mon Feb 22 08:46:04 2016: thd-0xed6600: amandad: pid 21287 finish time
Mon Feb 22 08:46:04 2016
Thanks,
--
Bill Carlson
Anything is possible, given Time and Money.