On Wednesday 09 May 2007, Steven Settlemyre wrote: >Can someone please help me? > >Steven Settlemyre wrote: >> I haven't changed my configs for months and things were running great >> until last week. Since last tues, none of my dailies have finished, >> and last night a monthly failed. >> >> Looking through the logs I see the problem always seems to start with >> "data write: Connection reset by peer" and "Don't know how to send >> ABORT command to chunker". I'm having a hard time interpreting the >> logs and can't seem to find too much in the archives about this. Was >> wondering if someone could walk me through an explanation of the >> problem and how to avoid it in the future. >> >> My monthlies run tape spanning on 3 40G tapes. >> >> Here is the email output generated: >> >> *** THE DUMPS DID NOT FINISH PROPERLY! >> >> These dumps were to tape Monthly21. >> The next 3 tapes Amanda expects to use are: Monthly01, Monthly02, >> Monthly03. >> The next 3 new tapes already labelled are: Monthly19, Monthly20, >> Monthly22. >> >> FAILURE AND STRANGE DUMP SUMMARY: >> wagstaff /usr/local lev 1 FAILED [data >> write: Connection reset by peer] >> lollipop /files1 lev 0 FAILED [data >> write: Connection reset by peer] >> helios /files3 lev 1 FAILED [data write: >> Connection reset by peer] >> helios / RESULTS MISSING >> helios /files2 RESULTS MISSING >> helios /usr RESULTS MISSING >> helios /usr/local RESULTS MISSING >> helios /var RESULTS MISSING >> lollipop / RESULTS MISSING >> lollipop /usr RESULTS MISSING >> lollipop /usr/local RESULTS MISSING >> wagstaff /files3 RESULTS MISSING >> wagstaff /files4 RESULTS MISSING >> wagstaff /files5 RESULTS MISSING >> wagstaff /files6/vol/Voiceware RESULTS MISSING >> wizard /files2 RESULTS MISSING >> snapserver /hd/vol_mnt0/shares/TermLab RESULTS MISSING >> snapserver /hd/vol_mnt0/shares/bcl RESULTS MISSING >> snapserver /hd/vol_mnt0/shares/biochem RESULTS MISSING >> snapserver /hd/vol_mnt0/shares/confocal RESULTS MISSING >> driver: FATAL Don't know how to send ABORT command to chunker >> chunker: FATAL error [bad command after RQ-MORE-DISK: "QUIT"] >> chunker: FATAL error [bad command after RQ-MORE-DISK: "QUIT"] >> chunker: FATAL error [bad command after RQ-MORE-DISK: "QUIT"] >> >> >> STATISTICS: >> Total Full Incr. >> -------- -------- -------- >> Estimate Time (hrs:min) 0:08 >> Run Time (hrs:min) 1:01 >> Dump Time (hrs:min) 1:55 1:40 0:16 >> Output Size (meg) 8519.7 7729.7 790.1 >> Original Size (meg) 13146.3 11595.5 1550.8 >> Avg Compressed Size (%) 64.8 66.7 50.9 (level:#disks >> ...) >> Filesystems Dumped 35 12 23 (1:23) >> Avg Dump Rate (k/s) 1261.0 1323.3 863.1 >> >> Tape Time (hrs:min) 0:53 0:44 0:09 >> Tape Size (meg) 8521.6 7730.3 791.3 >> Tape Used (%) 21.1 19.0 2.1 (level:#disks >> ...) >> Filesystems Taped 35 12 23 (1:23) >> (level:#chunks ...) >> Chunks Taped 35 12 23 (1:23) >> Avg Tp Write Rate (k/s) 2724.3 3000.8 1433.6 >> >> USAGE BY TAPE: >> Label Time Size % Nb Nc >> Monthly21 0:53 8726112k 21.1 35 35 >> >> >> FAILED AND STRANGE DUMP DETAILS: >> >> /-- wagstaff /usr/local lev 1 FAILED [data write: Connection reset by >> peer] >> sendbackup: start [wagstaff level 1] >> sendbackup: info BACKUP=/usr/sbin/ufsdump >> sendbackup: info RECOVER_CMD=/usr/local/bin/gzip -dc >> >> |/usr/sbin/ufsrestore -f... - >> >> sendbackup: info COMPRESS_SUFFIX=.gz >> sendbackup: info end >> >> | DUMP: Writing 32 Kilobyte records >> | DUMP: Date of this level 1 dump: Tue May 08 01:11:26 2007 >> | DUMP: Date of last level 0 dump: Mon Apr 30 23:54:14 2007 >> | DUMP: Dumping /dev/rdsk/c0t0d0s7 (wagstaff:/usr/local) to standard >> >> output. >> >> | DUMP: Mapping (Pass I) [regular files] >> | DUMP: Mapping (Pass II) [directories] >> | DUMP: Mapping (Pass II) [directories] >> | DUMP: Mapping (Pass II) [directories] >> | DUMP: Estimated 13585968 blocks (6633.77MB) on 0.10 tapes. >> | DUMP: Dumping (Pass III) [directories] >> | DUMP: Dumping (Pass IV) [regular files] >> | DUMP: 16.49% done, finished in 0:50 >> | DUMP: 28.34% done, finished in 0:57 >> | DUMP: 38.89% done, finished in 1:12 >> >> \-------- >> >> /-- lollipop /files1 lev 0 FAILED [data write: Connection reset by peer] >> sendbackup: start [lollipop level 0] >> sendbackup: info BACKUP=/usr/sbin/ufsdump >> sendbackup: info RECOVER_CMD=/usr/bin/gzip -dc |/usr/sbin/ufsrestore >> -f... - >> sendbackup: info COMPRESS_SUFFIX=.gz >> sendbackup: info end >> >> | DUMP: Writing 32 Kilobyte records >> | DUMP: Date of this level 0 dump: Mon May 07 23:52:56 2007 >> | DUMP: Date of last level 0 dump: the epoch >> | DUMP: Dumping /dev/rdsk/c0t2d0s2 (lollipop:/files1) to standard >> >> output. >> >> | DUMP: Mapping (Pass I) [regular files] >> | DUMP: Mapping (Pass II) [directories] >> | DUMP: Estimated 34371404 blocks (16782.91MB) on 0.25 tapes. >> | DUMP: Dumping (Pass III) [directories] >> | DUMP: Dumping (Pass IV) [regular files] >> | DUMP: 6.08% done, finished in 2:34 >> | DUMP: 13.06% done, finished in 2:13 >> | DUMP: 19.93% done, finished in 2:00 >> | DUMP: 26.85% done, finished in 1:49 >> | DUMP: 33.93% done, finished in 1:37 >> | DUMP: 41.38% done, finished in 1:25 >> | DUMP: 49.81% done, finished in 1:10 >> | DUMP: 57.36% done, finished in 0:59 >> | DUMP: 60.37% done, finished in 0:59 >> | DUMP: 64.71% done, finished in 0:54 >> | DUMP: 72.54% done, finished in 0:41 >> | DUMP: 78.46% done, finished in 0:32 >> | DUMP: 84.45% done, finished in 0:23 >> | DUMP: 89.91% done, finished in 0:15 >> | DUMP: 96.51% done, finished in 0:05 >> >> \-------- >> >> /-- helios /files3 lev 1 FAILED [data write: Connection reset by peer] >> sendbackup: start [helios level 1] >> sendbackup: info BACKUP=/bin/tar >> sendbackup: info RECOVER_CMD=/bin/gzip -dc |/bin/tar -f... - >> sendbackup: info COMPRESS_SUFFIX=.gz >> sendbackup: info end >> \-------- >> >> >> NOTES: >> planner: wagstaff /files6/vol/speech7 20070507 0 [dumps too big, 9328 >> KB, full dump delayed] >> planner: oz /files1 20070507 0 [dumps too big, 11716 KB, full dump >> delayed] >> planner: wagstaff /files6/vol/bvd 20070507 0 [dumps too big, 12856 >> KB, full dump delayed] >> planner: oz / 20070507 0 [dumps too big, 21760 KB, full dump delayed] >> planner: wagstaff / 20070507 0 [dumps too big, 23105 KB, full dump >> delayed] >> planner: helios / 20070507 0 [dumps too big, 38583 KB, full dump >> delayed] >> planner: wizard / 20070507 0 [dumps too big, 44165 KB, full dump >> delayed] >> planner: snapserver /hd/vol_mnt0/shares/surgery 20070507 0 [dumps too >> big, 91538 KB, full dump delayed] >> planner: wagstaff /usr 20070507 0 [dumps too big, 112839 KB, full >> dump delayed] >> planner: lollipop /usr 20070507 0 [dumps too big, 131680 KB, full >> dump delayed] >> planner: wizard /usr 20070507 0 [dumps too big, 162844 KB, full dump >> delayed] >> planner: martin /var 20070507 0 [dumps too big, 178985 KB, full dump >> delayed] >> planner: wizard /var 20070507 0 [dumps too big, 210323 KB, full dump >> delayed] >> planner: professor /usr/local 20070507 0 [dumps too big, 219910 KB, >> full dump delayed] >> planner: snapserver /hd/vol_mnt0/shares/NeuroGen 20070507 0 [dumps >> too big, 247911 KB, full dump delayed] >> planner: wagstaff /files6/vol/spdata1 20070507 0 [dumps too big, >> 358932 KB, full dump delayed] >> planner: lollipop /usr/local 20070507 0 [dumps too big, 436518 KB, >> full dump delayed] >> planner: wizard /usr/local 20070507 0 [dumps too big, 458028 KB, full >> dump delayed] >> planner: snapserver /hd/vol_mnt0/shares/immuno 20070507 0 [dumps too >> big, 494376 KB, full dump delayed] >> planner: oz /usr 20070507 0 [dumps too big, 520179 KB, full dump >> delayed] >> planner: wagstaff /files6/vol/spdata2 20070507 0 [dumps too big, >> 522063 KB, full dump delayed] >> planner: snapserver /hd/vol_mnt0/shares/MID 20070507 0 [dumps too >> big, 522216 KB, full dump delayed] >> planner: professor /usr 20070507 0 [dumps too big, 565153 KB, full >> dump delayed] >> planner: helios /usr 20070507 0 [dumps too big, 639655 KB, full dump >> delayed] >> planner: helios /var 20070507 0 [dumps too big, 1609562 KB, full dump >> delayed] >> planner: helios /usr/local 20070507 0 [dumps too big, 1930526 KB, >> full dump delayed] >> planner: helios /files2 20070507 0 [dumps too big, 3326170 KB, full >> dump delayed] >> planner: snapserver /hd/vol_mnt0/shares/BioInf 20070507 0 [dumps too >> big, 4663399 KB, full dump delayed] >> planner: wagstaff /usr/local 20070507 0 [dumps too big, 5087313 KB, >> full dump delayed] >> planner: snapserver /hd/vol_mnt0/shares/urology 20070507 0 [dumps too >> big, 5786512 KB, full dump delayed] >> planner: helios /files3 20070507 0 [dumps too big, 8327643 KB, full >> dump delayed] >> planner: wagstaff /files1 20070507 0 [dumps too big, 11203120 KB, >> full dump delayed] >> taper: tape Monthly21 kb 8726112 fm 35 [OK] >> >> >> DUMP SUMMARY: >> DUMPER STATS >> TAPER STATS >> HOSTNAME DISK L ORIG-kB OUT-kB COMP% MMM:SS KB/s >> MMM:SS KB/s >> -------------------------- ------------------------------------------- >> ------------- >> helios / MISSING >> ------------------------------------------------- >> helios /files2 MISSING >> ------------------------------------------------- >> helios /files3 1 FAILED >> -------------------------------------------------- >> helios /usr MISSING >> ------------------------------------------------- >> helios /usr/local MISSING >> ------------------------------------------------- >> helios /var MISSING >> ------------------------------------------------- >> lollipop / MISSING >> ------------------------------------------------- >> lollipop /files1 0 FAILED >> -------------------------------------------------- >> lollipop /usr MISSING >> ------------------------------------------------- >> lollipop /usr/local MISSING >> ------------------------------------------------- >> martin /var 1 33930 15808 46.6 0:03 5776.6 >> 0:04 4107.7 >> oz / 1 290 96 33.1 0:00 404.7 0:01 >> 83.3 >> oz /files1 1 3640 1344 36.9 0:00 4641.9 0:04 >> 304.1 >> oz /usr 1 60180 18432 30.6 0:05 3425.3 0:10 >> 1822.2 >> oz /var 0 796760 339968 42.7 1:34 3632.0 3:12 >> 1771.7 >> professor.as / 0 104980 46016 43.8 0:22 >> 2099.4 0:09 4941.8 >> professor.as /usr 1 6590 640 9.7 0:01 >> 449.2 0:01 534.7 >> professor.as /usr/local 1 1640 192 11.7 0:00 >> 422.0 0:09 21.8 >> snapserver -res/BioInf 1 2380 416 17.5 0:13 27.5 >> 0:05 90.0 >> snapserver -shares/MID 1 310 96 31.0 0:04 11.8 >> 0:01 83.7 >> snapserver -es/MolGene 0 10 64 640.0 0:00 32.8 >> 0:03 23.2 >> snapserver -s/NeuroGen 1 80 64 80.0 0:01 10.5 >> 0:01 56.0 >> snapserver -es/TermLab MISSING >> ------------------------------------------------- >> snapserver -ares/admin 0 10 64 640.0 0:00 38.7 >> 0:01 45.1 >> snapserver -shares/bcl MISSING >> ------------------------------------------------- >> snapserver -es/biochem MISSING >> ------------------------------------------------- >> snapserver -s/confocal MISSING >> ------------------------------------------------- >> snapserver -ares/histo 0 10 64 640.0 0:00 31.3 >> 0:01 58.4 >> snapserver -res/immuno 1 3400 416 12.2 0:04 87.6 >> 0:01 292.0 >> snapserver -ares/mcard 0 10 64 640.0 0:00 36.2 >> 0:01 54.6 >> snapserver -ares/mysql 0 10 64 640.0 0:00 41.4 >> 0:02 26.7 >> snapserver -s/neurosci 0 4421720 2413152 54.6 12:45 3155.2 >> 10:18 3905.8 >> snapserver -es/surgery 1 10 64 640.0 0:00 39.4 >> 0:07 8.6 >> snapserver -s/sysadmin 0 2010 2048 101.9 0:00 6036.8 >> 0:03 586.2 >> snapserver -es/urology 1 160880 113472 70.5 0:19 5824.1 >> 0:50 2275.4 >> wagstaff / 1 95 64 67.4 0:04 2.9 >> 0:01 56.7 >> wagstaff /files1 1 824351 564832 68.5 8:59 1048.6 >> 3:52 2436.2 >> wagstaff /files3 MISSING >> ------------------------------------------------- >> wagstaff /files4 MISSING >> ------------------------------------------------- >> wagstaff /files5 MISSING >> ------------------------------------------------- >> wagstaff -/Voiceware MISSING >> ------------------------------------------------- >> wagstaff -s6/vol/bvd 1 10 64 640.0 0:00 4.5 >> 0:01 56.7 >> wagstaff -ol/spdata1 1 1890 416 22.0 0:09 42.3 >> 0:01 359.9 >> wagstaff -ol/spdata2 1 480 160 33.3 0:02 50.8 >> 0:04 36.5 >> wagstaff -ol/speech7 1 40 64 160.0 0:00 14.4 >> 0:01 57.9 >> wagstaff /usr 1 31 64 206.5 0:05 0.2 >> 0:01 55.9 >> wagstaff /usr/local 1 FAILED >> -------------------------------------------------- >> wizard / 1 95 64 67.4 0:04 2.9 >> 0:02 35.1 >> wizard /files1 0 4576895 4086720 89.3 63:58 1064.8 >> 14:28 4706.1 >> wizard /files2 MISSING >> ------------------------------------------------- >> wizard /usr 1 63 64 101.6 0:06 0.2 >> 0:01 58.0 >> wizard /usr/local 1 369407 21600 5.8 2:39 135.8 >> 2:23 151.2 >> wizard /var 1 118239 71840 60.8 2:38 454.1 >> 1:22 872.8 >> wizard /var/log 0 110271 72032 65.3 2:10 554.3 >> 11:28 104.7 >> wizard /var/mail 0 1861087 955584 51.3 18:53 843.7 >> 4:10 3820.7 >> >> (brought to you by Amanda version 2.5.0p2)
This version is getting a wee bit long in the tooth but I think I'd look in the logs for timeout messages.. One of the reasons for a lack of reply might be that no one is ATM, running a duplicate of your config as deduced from the limited info posted. More details might help. -- Cheers, Gene "There are four boxes to be used in defense of liberty: soap, ballot, jury, and ammo. Please use in that order." -Ed Howdershelt (Author) You have all the characteristics of a popular politician: a horrible voice, bad breeding, and a vulgar manner. -- Aristophanes