Slow client

2007-05-09 Thread Olivier Nicole
Hi,

I noticed yesterday that one of my clients suddenly became very slow:

by wall clock, seeing the file grows on the holding disk, it takes 9
minutes for 5 MB of GNUTAR level 2. previous run (Apr. 28th) for the
same machine, same DLE, same level was 107 KB/s for the dump according
to Amanda report.

The slowness can be seen for every DLE on that client. Other than
that, the client is working fine, load average is close to 0, it is
not swapping or anything, network seems to be working fine.

Any idea?

I will run some more diagnostics later, once the backup is finished.

Best regards,

Olivier




Re: Slow client

2007-05-09 Thread Frank Smith
Olivier Nicole wrote:
 Hi,
 
 I noticed yesterday that one of my clients suddenly became very slow:
 
 by wall clock, seeing the file grows on the holding disk, it takes 9
 minutes for 5 MB of GNUTAR level 2. previous run (Apr. 28th) for the
 same machine, same DLE, same level was 107 KB/s for the dump according
 to Amanda report.

That works out to about 9kB/s, which is pretty slow.
Do you have a duplex mismatch or other errors on the network interface
or switch port?
Are you seeing any system messages about read errors on your disks?
Have you recently changed OS/kernel/tar versions?
What does the Amanda report show for dumper time and taper time for
the DLE?

Frank

 
 The slowness can be seen for every DLE on that client. Other than
 that, the client is working fine, load average is close to 0, it is
 not swapping or anything, network seems to be working fine.
 
 Any idea?
 
 I will run some more diagnostics later, once the backup is finished.
 
 Best regards,
 
 Olivier
 
 


-- 
Frank Smith  [EMAIL PROTECTED]
Sr. Systems Administrator   Voice: 512-374-4673
Hoover's Online   Fax: 512-374-4501


Re: Slow client

2007-05-09 Thread Olivier Nicole
Frank,

 Do you have a duplex mismatch or other errors on the network interface
 or switch port?

No error visible.

A manual transfer was also very slow, a reboot of the machine solved
that.

 Are you seeing any system messages about read errors on your disks?
 Have you recently changed OS/kernel/tar versions?

Nothing changed for a good while, only I did reboot the network
equipment some time last week. I beleive the duplex did not get
negociated properly after reboot.

 What does the Amanda report show for dumper time and taper time for
 the DLE?

HOSTNAME DISKL  ORIG-KB   OUT-KB  COMP% MMM:SS   KB/s MMM:SS   KB/s
-- -- -
mailback /   04508018784   41.7  41:517.5   0:10 1870.1
mailback /home   2   187190   106368   56.8 213:068.3   0:30 3489.2
mailback /usr0 FAILED -
mailback /var043080 6752   15.7  49:432.3   0:07  956.0

That's yucky :)

OK that was only a network problem it seems.

Thanks,

Olivier


2.5.2b-20070507 build

2007-05-09 Thread Gene Heskett
Greetings;

I saw this go by during the build, does it mean anything?

Name main::include_list used only once: possible typo at amgtar line 50.
Name main::ksize used only once: possible typo at amgtar line 263.
Name main::opt_record used only once: possible typo at amgtar line 236.
Name main::message_line used only once: possible typo at amgtar line 46.
Name main::max_level used only once: possible typo at amgtar line 43.
Name main::collection used only once: possible typo at amgtar line 53.
Name main::index_xml used only once: possible typo at amgtar line 45.
Name main::group_default used only once: possible typo at amgtar line 86.
Name main::exclude_file used only once: possible typo at amgtar line 51.
Name main::record used only once: possible typo at amgtar line 48.
Name main::message_xml used only once: possible typo at amgtar line 47.
Name main::index_line used only once: possible typo at amgtar line 44.
Name main::include_file used only once: possible typo at amgtar line 49.
Name main::user_backup used only once: possible typo at amgtar line 67.
Name main::user_default used only once: possible typo at amgtar line 85.
Name main::exclude_list used only once: possible typo at amgtar line 52.

-- 
Cheers, Gene
There are four boxes to be used in defense of liberty:
 soap, ballot, jury, and ammo. Please use in that order.
-Ed Howdershelt (Author)
The universe is like a safe to which there is a combination -- but the
combination is locked up in the safe.
-- Peter DeVries


Re: 2.5.2 compilation failure on irix-6.5.x

2007-05-09 Thread Jean-Louis Martineau

Pieter Bowman wrote:

This is the case on both IRIX and OSF/1.  These compilers can't
handle:

#define vstrallocf(...) debug_vstrallocf(__FILE__,__LINE__,__VA_ARGS__)
  


I fix it in the 2.5.2 tree, but the next release will require that 
syntax, which is in the C99 standard.

Do you have a C99 compiler on IRIX and OSF/1 ?

Jean-Louis


db-based infofiles

2007-05-09 Thread dustin
In working on cleaning up the amanda codebase, I discovered that at
least some of the db-based infofile backends (--with-db=gdbm, ndbm, db,
dbm) probably don't work.  I talked with Jean-Louis Martineau, and he
mentioned that they have been deprecated for a long time in favor of the
text database.

I'd like to remove this support completely in the next release.

So: are you or anyone you know using non-text infofiles?

Dustin

-- 
Dustin J. Mitchell
Storage Software Engineer, Zmanda, Inc.
http://www.zmanda.com/


Backup on DVD-RAM

2007-05-09 Thread Sebastian Marten
Hello,
i have a working Amanda configuration which backup on DVD-RAM. (its like
writing on a hard disk)

The disks are all smaller than 4.3 GB, so a backup is no problem.

But what happens if on disk gets greater than 4.3 GB?

Can Amanda handle it, and ask for a second DVD to write the dumps on, or
 must i set a special option in the config?

Any suggestions?

regards,

Sebastian





signature.asc
Description: OpenPGP digital signature


Please help... no backups since a week ago (data write: Connection reset by peer)

2007-05-09 Thread Steven Settlemyre

Can someone please help me?

Steven Settlemyre wrote:
I haven't changed my configs for months and things were running great 
until last week. Since last tues, none of my dailies have finished, 
and last night a monthly failed.


Looking through the logs I see the problem always seems to start with 
data write: Connection reset by peer and Don't know how to send 
ABORT command to chunker. I'm having a hard time interpreting the 
logs and can't seem to find too much in the archives about this. Was 
wondering if someone could walk me through an explanation of the 
problem and how to avoid it in the future.


My monthlies run tape spanning on 3 40G tapes.

Here is the email output generated:

*** THE DUMPS DID NOT FINISH PROPERLY!

These dumps were to tape Monthly21.
The next 3 tapes Amanda expects to use are: Monthly01, Monthly02, 
Monthly03.
The next 3 new tapes already labelled are: Monthly19, Monthly20, 
Monthly22.


FAILURE AND STRANGE DUMP SUMMARY:
 wagstaff  /usr/locallev 1  FAILED [data 
write: Connection reset by peer]
 lollipop  /files1   lev 0  FAILED [data 
write: Connection reset by peer]
 helios  /files3   lev 1  FAILED [data write: 
Connection reset by peer]

 helios  / RESULTS MISSING
 helios  /files2   RESULTS MISSING
 helios  /usr  RESULTS MISSING
 helios  /usr/localRESULTS MISSING
 helios  /var  RESULTS MISSING
 lollipop  / RESULTS MISSING
 lollipop  /usr  RESULTS MISSING
 lollipop  /usr/localRESULTS MISSING
 wagstaff  /files3   RESULTS MISSING
 wagstaff  /files4   RESULTS MISSING
 wagstaff  /files5   RESULTS MISSING
 wagstaff  /files6/vol/Voiceware RESULTS MISSING
 wizard/files2   RESULTS MISSING
 snapserver  /hd/vol_mnt0/shares/TermLab   RESULTS MISSING
 snapserver  /hd/vol_mnt0/shares/bcl   RESULTS MISSING
 snapserver  /hd/vol_mnt0/shares/biochem   RESULTS MISSING
 snapserver  /hd/vol_mnt0/shares/confocal  RESULTS MISSING
 driver: FATAL Don't know how to send ABORT command to chunker
 chunker: FATAL error [bad command after RQ-MORE-DISK: QUIT]
 chunker: FATAL error [bad command after RQ-MORE-DISK: QUIT]
 chunker: FATAL error [bad command after RQ-MORE-DISK: QUIT]


STATISTICS:
 Total   Full  Incr.
         
Estimate Time (hrs:min)0:08
Run Time (hrs:min) 1:01
Dump Time (hrs:min)1:55   1:40   0:16
Output Size (meg)8519.7 7729.7  790.1
Original Size (meg) 13146.311595.5 1550.8
Avg Compressed Size (%)64.8   66.7   50.9   (level:#disks 
...)

Filesystems Dumped   35 12 23   (1:23)
Avg Dump Rate (k/s)  1261.0 1323.3  863.1

Tape Time (hrs:min)0:53   0:44   0:09
Tape Size (meg)  8521.6 7730.3  791.3
Tape Used (%)  21.1   19.02.1   (level:#disks 
...)

Filesystems Taped35 12 23   (1:23)
  (level:#chunks ...)
Chunks Taped 35 12 23   (1:23)
Avg Tp Write Rate (k/s)  2724.3 3000.8 1433.6

USAGE BY TAPE:
 Label   Time  Size  %NbNc
 Monthly21   0:53  8726112k   21.13535


FAILED AND STRANGE DUMP DETAILS:

/--  wagstaff /usr/local lev 1 FAILED [data write: Connection reset by 
peer]

sendbackup: start [wagstaff level 1]
sendbackup: info BACKUP=/usr/sbin/ufsdump
sendbackup: info RECOVER_CMD=/usr/local/bin/gzip -dc 
|/usr/sbin/ufsrestore -f... -

sendbackup: info COMPRESS_SUFFIX=.gz
sendbackup: info end
|   DUMP: Writing 32 Kilobyte records
|   DUMP: Date of this level 1 dump: Tue May 08 01:11:26 2007
|   DUMP: Date of last level 0 dump: Mon Apr 30 23:54:14 2007
|   DUMP: Dumping /dev/rdsk/c0t0d0s7 (wagstaff:/usr/local) to standard 
output.

|   DUMP: Mapping (Pass I) [regular files]
|   DUMP: Mapping (Pass II) [directories]
|   DUMP: Mapping (Pass II) [directories]
|   DUMP: Mapping (Pass II) [directories]
|   DUMP: Estimated 13585968 blocks (6633.77MB) on 0.10 tapes.
|   DUMP: Dumping (Pass III) [directories]
|   DUMP: Dumping (Pass IV) [regular files]
|   DUMP: 16.49% done, finished in 0:50
|   DUMP: 28.34% done, finished in 0:57
|   DUMP: 38.89% done, finished in 1:12
\

/--  lollipop /files1 lev 0 FAILED [data write: Connection reset by peer]
sendbackup: start [lollipop level 0]
sendbackup: info BACKUP=/usr/sbin/ufsdump
sendbackup: info RECOVER_CMD=/usr/bin/gzip -dc |/usr/sbin/ufsrestore 
-f... -

sendbackup: info COMPRESS_SUFFIX=.gz
sendbackup: info end
|   DUMP: Writing 32 Kilobyte records
|   DUMP: Date of this level 0 dump: Mon May 07 23:52:56 2007
|   

Re: Please help... no backups since a week ago (data write: Connection reset by peer)

2007-05-09 Thread Gene Heskett
On Wednesday 09 May 2007, Steven Settlemyre wrote:
Can someone please help me?

Steven Settlemyre wrote:
 I haven't changed my configs for months and things were running great
 until last week. Since last tues, none of my dailies have finished,
 and last night a monthly failed.

 Looking through the logs I see the problem always seems to start with
 data write: Connection reset by peer and Don't know how to send
 ABORT command to chunker. I'm having a hard time interpreting the
 logs and can't seem to find too much in the archives about this. Was
 wondering if someone could walk me through an explanation of the
 problem and how to avoid it in the future.

 My monthlies run tape spanning on 3 40G tapes.

 Here is the email output generated:

 *** THE DUMPS DID NOT FINISH PROPERLY!

 These dumps were to tape Monthly21.
 The next 3 tapes Amanda expects to use are: Monthly01, Monthly02,
 Monthly03.
 The next 3 new tapes already labelled are: Monthly19, Monthly20,
 Monthly22.

 FAILURE AND STRANGE DUMP SUMMARY:
  wagstaff  /usr/locallev 1  FAILED [data
 write: Connection reset by peer]
  lollipop  /files1   lev 0  FAILED [data
 write: Connection reset by peer]
  helios  /files3   lev 1  FAILED [data write:
 Connection reset by peer]
  helios  / RESULTS MISSING
  helios  /files2   RESULTS MISSING
  helios  /usr  RESULTS MISSING
  helios  /usr/localRESULTS MISSING
  helios  /var  RESULTS MISSING
  lollipop  / RESULTS MISSING
  lollipop  /usr  RESULTS MISSING
  lollipop  /usr/localRESULTS MISSING
  wagstaff  /files3   RESULTS MISSING
  wagstaff  /files4   RESULTS MISSING
  wagstaff  /files5   RESULTS MISSING
  wagstaff  /files6/vol/Voiceware RESULTS MISSING
  wizard/files2   RESULTS MISSING
  snapserver  /hd/vol_mnt0/shares/TermLab   RESULTS MISSING
  snapserver  /hd/vol_mnt0/shares/bcl   RESULTS MISSING
  snapserver  /hd/vol_mnt0/shares/biochem   RESULTS MISSING
  snapserver  /hd/vol_mnt0/shares/confocal  RESULTS MISSING
  driver: FATAL Don't know how to send ABORT command to chunker
  chunker: FATAL error [bad command after RQ-MORE-DISK: QUIT]
  chunker: FATAL error [bad command after RQ-MORE-DISK: QUIT]
  chunker: FATAL error [bad command after RQ-MORE-DISK: QUIT]


 STATISTICS:
  Total   Full  Incr.
      
 Estimate Time (hrs:min)0:08
 Run Time (hrs:min) 1:01
 Dump Time (hrs:min)1:55   1:40   0:16
 Output Size (meg)8519.7 7729.7  790.1
 Original Size (meg) 13146.311595.5 1550.8
 Avg Compressed Size (%)64.8   66.7   50.9   (level:#disks
 ...)
 Filesystems Dumped   35 12 23   (1:23)
 Avg Dump Rate (k/s)  1261.0 1323.3  863.1

 Tape Time (hrs:min)0:53   0:44   0:09
 Tape Size (meg)  8521.6 7730.3  791.3
 Tape Used (%)  21.1   19.02.1   (level:#disks
 ...)
 Filesystems Taped35 12 23   (1:23)
   (level:#chunks ...)
 Chunks Taped 35 12 23   (1:23)
 Avg Tp Write Rate (k/s)  2724.3 3000.8 1433.6

 USAGE BY TAPE:
  Label   Time  Size  %NbNc
  Monthly21   0:53  8726112k   21.13535


 FAILED AND STRANGE DUMP DETAILS:

 /--  wagstaff /usr/local lev 1 FAILED [data write: Connection reset by
 peer]
 sendbackup: start [wagstaff level 1]
 sendbackup: info BACKUP=/usr/sbin/ufsdump
 sendbackup: info RECOVER_CMD=/usr/local/bin/gzip -dc

 |/usr/sbin/ufsrestore -f... -

 sendbackup: info COMPRESS_SUFFIX=.gz
 sendbackup: info end

 |   DUMP: Writing 32 Kilobyte records
 |   DUMP: Date of this level 1 dump: Tue May 08 01:11:26 2007
 |   DUMP: Date of last level 0 dump: Mon Apr 30 23:54:14 2007
 |   DUMP: Dumping /dev/rdsk/c0t0d0s7 (wagstaff:/usr/local) to standard

 output.

 |   DUMP: Mapping (Pass I) [regular files]
 |   DUMP: Mapping (Pass II) [directories]
 |   DUMP: Mapping (Pass II) [directories]
 |   DUMP: Mapping (Pass II) [directories]
 |   DUMP: Estimated 13585968 blocks (6633.77MB) on 0.10 tapes.
 |   DUMP: Dumping (Pass III) [directories]
 |   DUMP: Dumping (Pass IV) [regular files]
 |   DUMP: 16.49% done, finished in 0:50
 |   DUMP: 28.34% done, finished in 0:57
 |   DUMP: 38.89% done, finished in 1:12

 \

 /--  lollipop /files1 lev 0 FAILED [data write: Connection reset by peer]
 sendbackup: start [lollipop level 0]
 sendbackup: info BACKUP=/usr/sbin/ufsdump
 sendbackup: info RECOVER_CMD=/usr/bin/gzip -dc |/usr/sbin/ufsrestore
 -f... -
 sendbackup: info 

Re: amanda upgrade questions

2007-05-09 Thread Guy Dallaire

Replying to myself:

It's my second attempt at trying to get any input as to how difficult it may
be to upgrade amanda. I have not received any reply on both occasions.

Are my questions too dumb ?   I would like to read some sort of upgrade
guide, but I can't find anything useful in the doc or in the amanda wiki.

2.5 seems pretty scary. I see a lot of problems on the list. Is it stable,
or is it experimental ?

Thanks

2007/5/7, Guy Dallaire [EMAIL PROTECTED]:


I understand that amanda 2.5 is compatible with amanda 2.4 clients. This
means I could upgrade my server and do the clients later ? I would like to
upgrade to 2.5.2 for security reason (ability to encrypt backups, ssh)

I compiled my server and client with restriction on the tcp and udp ports
used for ease of firewall configuration:

--with-tcpportrange=5,50100' '--with-udpportrange=850,859

Do these options still apply to the new amanda version ?

Will the new version be able to restore old tapes that were written by the
amanda 2.4 tapeserver ?

Will amrecover on a 2.4 client be able to use the 2.5 amanda server ?

Are the new encryption and ssh features easy to use ?

Thanks !



Hardware suggestion

2007-05-09 Thread Olivier Nicole
Hi,

I am considering replacement for my Amanda server, the guy has been in
production for about 6 years now and I want to be ready for the time
when the tape drive will die.

We currently use SLR100, wich is fine, very little tape failure, but
tapes are expensive, a 50 GB (native) 100 GB (compressed) tape is the
price of a 250 GB hard disk. The tape drive itself cost twice the
price of the server at that time.

So we are considering vtapes, but I would like to have hard disk
physical rotation, the same way we have tape rotation --and it is
manual changing of the tape every day.

The only answer I don't have concernes the hardware to use for disk
hot plug mounting and unmounting (on the software side,I now thatthe
disk can be disconnected from the operating system before they are
physically removed from the machine). Normal hot swap bay for hard
disk are not designed for daily use, they are designed for maintenance
only, and would break soon if I swaped the disk every day.

Could anyone advise on some disk tray solution that is stong and
reliable and will handle a lot of manipulation. idealy I wouldbe
looking at one tray bay and 5 or 6 trays for the disks.

best regards,

Olivier


Re: amanda upgrade questions

2007-05-09 Thread Jon LaBadie
On Wed, May 09, 2007 at 09:38:20PM -0400, Guy Dallaire wrote:
 Replying to myself:
 
 It's my second attempt at trying to get any input as to how difficult it may
 be to upgrade amanda. I have not received any reply on both occasions.
 
 Are my questions too dumb ?   I would like to read some sort of upgrade
 guide, but I can't find anything useful in the doc or in the amanda wiki.
 
 2.5 seems pretty scary. I see a lot of problems on the list. Is it stable,
 or is it experimental ?
 
 Thanks
 
 2007/5/7, Guy Dallaire [EMAIL PROTECTED]:
 
 I understand that amanda 2.5 is compatible with amanda 2.4 clients. This
 means I could upgrade my server and do the clients later ? I would like to
 upgrade to 2.5.2 for security reason (ability to encrypt backups, ssh)

Being compatible does not mean that the clients gain the new features
of the upgraded server.  If the client could not do encryption or ssh
before, upgrading the server will not change the client.



-- 
Jon H. LaBadie  [EMAIL PROTECTED]
 JG Computing
 4455 Province Line Road(609) 252-0159
 Princeton, NJ  08540-4322  (609) 683-7220 (fax)


Re: amanda upgrade questions

2007-05-09 Thread Anthony Ho
Hi,

It is actually quite simply to upgrade amanda. All you need to do is to
download the latest version of Amanda (for this case amanda 2.5.2),
untar it and execute the following commands to clean and recompile the
latest build.

make distclean
./configure -with-tcpportrange=5,50100 --with-udpportrange=850,859
make  make install

But becareful with amanda version 2.5.2 as it contains some bug issue
with the amlabel on the vtapes.

Regards 
Anthony

On Wed, 2007-05-09 at 21:38 -0400, Guy Dallaire wrote:
 Replying to myself:
 
 It's my second attempt at trying to get any input as to how difficult
 it may be to upgrade amanda. I have not received any reply on both
 occasions.
 
 Are my questions too dumb ?   I would like to read some sort of
 upgrade guide, but I can't find anything useful in the doc or in the
 amanda wiki. 
 
 2.5 seems pretty scary. I see a lot of problems on the list. Is it
 stable, or is it experimental ?
 
 Thanks 
 
 2007/5/7, Guy Dallaire [EMAIL PROTECTED]:
 I understand that amanda 2.5 is compatible with amanda 2.4
 clients. This means I could upgrade my server and do the
 clients later ? I would like to upgrade to 2.5.2 for security
 reason (ability to encrypt backups, ssh)
 
 I compiled my server and client with restriction on the tcp
 and udp ports used for ease of firewall configuration:
 
 --with-tcpportrange=5,50100' '--with-udpportrange=850,859 
 
 Do these options still apply to the new amanda version ? 
 
 Will the new version be able to restore old tapes that were
 written by the amanda 2.4 tapeserver ?
 
 Will amrecover on a 2.4 client be able to use the 2.5 amanda
 server ?
 
 Are the new encryption and ssh features easy to use ? 
 
 Thanks !