Re: I had enough holding disk but part of the dump failed -

2000-11-04 Thread Jean-Louis Martineau

On Fri, Nov 03, 2000 at 11:23:10PM -0500, John R. Jackson wrote:
 I think that's what's happening, a dump with est_kps larger than the
 bandwidth will never be scheduled to dump on holding disk.
 
 I agree it looks like that could happen in principle (in fact, I wonder
 if other such "don't do this now" states should be examined), but in this
 particular case the estimated bandwidth for the two disks not processed
 was well below the available.
 

The schedule give:

 admin1.cor sda9 lv 2 t 5 s52939 p 13


52939/5 = 10587  that's above 1, and that's why sda9 didn't get dumped.

As Alexandre said, sda10 didn't get dumped because the disk was big and
the chunksize set to -1.

Jean-Louis
-- 
Jean-Louis Martineau email: [EMAIL PROTECTED] 
Departement IRO, Universite de Montreal
C.P. 6128, Succ. CENTRE-VILLETel: (514) 343-6111 ext. 3529
Montreal, Canada, H3C 3J7Fax: (514) 343-5834



Re: I had enough holding disk but part of the dump failed -

2000-11-04 Thread Jean-Louis Martineau

On Sat, Nov 04, 2000 at 09:35:54AM -0500, Jean-Louis Martineau wrote:
 On Fri, Nov 03, 2000 at 11:23:10PM -0500, John R. Jackson wrote:
  I think that's what's happening, a dump with est_kps larger than the
  bandwidth will never be scheduled to dump on holding disk.
  
  I agree it looks like that could happen in principle (in fact, I wonder
  if other such "don't do this now" states should be examined), but in this
  particular case the estimated bandwidth for the two disks not processed
  was well below the available.
  
 
 The schedule give:
 
  admin1.cor sda9 lv 2 t 5 s52939 p 13
 
 
 52939/5 = 10587  that's above 1, and that's why sda9 didn't get dumped.

In free_kps()  we have:
/* XXX - kludge - if we are currently using nothing
**   on this interface then lie and say he can
**   have as much as he likes.
*/
if (ip-curusage == 0) res = 1;
else res = ip-maxusage - ip-curusage;

Even if maxusage  1, it will never return more than 1 if curusage==0.

Should we fix it and increased it to 10?
Or my previous patch is better and we should remove this kludge?

Jean-Louis
-- 
Jean-Louis Martineau email: [EMAIL PROTECTED] 
Departement IRO, Universite de Montreal
C.P. 6128, Succ. CENTRE-VILLETel: (514) 343-6111 ext. 3529
Montreal, Canada, H3C 3J7Fax: (514) 343-5834



Re: I had enough holding disk but part of the dump failed -

2000-11-03 Thread Scot W. Hetzel

From: "Denise Ives" [EMAIL PROTECTED]
 claiming there was [no more holding disk space] -

 What do you think went wrong?

 Filesystemkbytesused   avail capacity  Mounted on
 /dev/md/dsk/d20  11560144 1847220 959732317%/dump

 Subject: daily AMANDA MAIL REPORT FOR November 3, 2000



 FAILURE AND STRANGE DUMP SUMMARY:
   admin1.cor sda9 lev 2 FAILED [no more holding disk space]
   admin1.cor sda10 lev 1 FAILED [no more holding disk space]


Your amanda.conf file may be specifying a maximum amount of drive space to
use for the holding disk.

holdingdisk hd1 {
comment "main holding disk"
directory "/dumps/amanda"   # where the holding disk is
use -20 Mb  # how much space can we use on it
# a negative value mean:
#use all space except that value
chunksize 1 Gb  # size of chunk if you want big dump to be
# dumped on multiple files on holding disks
#  N Kb/Mb/Gb split disks in chunks of size N
#  0  split disks in INT_MAX/1024 Kb chunks
# -1  same as -INT_MAX/1024 (see below)
# -N Kb/Mb/Gb dont split, dump larger
# filesystems directly to tape
# (example: -2 Gb)
# chunksize 2 Gb
}

Try, changing the "use" size to a more appropriate valuse for your setup.

Scot




Re: I had enough holding disk but part of the dump failed -

2000-11-03 Thread Denise Ives

Here is my amanda.conf file with respect to the holding disk - 


holdingdisk hd1 {
comment "main holding disk"
directory "/dump/amanda"# where the holding disk is
use -500Mb  # how much space can we use on it
# a negative value mean:
#use all space except that value
chunksize -1# size of chunk if you want big dump to be
# dumped on multiple files on holding disks
#  N Kb/Mb/Gb split disks in chunks of size N
#  0  split disks in INT_MAX/1024 Kb
chunks
# -1  same as -INT_MAX/1024 (see below)
# -N Kb/Mb/Gb dont split, dump larger
# filesystems directly to tape
# (example: -2 Gb)



Is the use wrong?

**


On Fri, 3 Nov 2000, Scot W. Hetzel wrote:

 From: "Denise Ives" [EMAIL PROTECTED]
  claiming there was [no more holding disk space] -
 
  What do you think went wrong?
 
  Filesystemkbytesused   avail capacity  Mounted on
  /dev/md/dsk/d20  11560144 1847220 959732317%/dump
 
  Subject: daily AMANDA MAIL REPORT FOR November 3, 2000
 
 
 
  FAILURE AND STRANGE DUMP SUMMARY:
admin1.cor sda9 lev 2 FAILED [no more holding disk space]
admin1.cor sda10 lev 1 FAILED [no more holding disk space]
 
 
 Your amanda.conf file may be specifying a maximum amount of drive space to
 use for the holding disk.
 
 holdingdisk hd1 {
 comment "main holding disk"
 directory "/dumps/amanda"   # where the holding disk is
 use -20 Mb  # how much space can we use on it
 # a negative value mean:
 #use all space except that value
 chunksize 1 Gb  # size of chunk if you want big dump to be
 # dumped on multiple files on holding disks
 #  N Kb/Mb/Gb split disks in chunks of size N
 #  0  split disks in INT_MAX/1024 Kb chunks
 # -1  same as -INT_MAX/1024 (see below)
 # -N Kb/Mb/Gb dont split, dump larger
 # filesystems directly to tape
 # (example: -2 Gb)
 # chunksize 2 Gb
 }
 
 Try, changing the "use" size to a more appropriate valuse for your setup.
 
 Scot
 





Re: I had enough holding disk but part of the dump failed -

2000-11-03 Thread John R. Jackson

Here is my amanda.conf file with respect to the holding disk - 
...
use -500Mb ...

That's OK.

chunksize -1 ...

This is not related to your problem, but you should change this at
some point to something like "1000 Mb".

Is the use wrong?

It might be a little large (why not let Amanda have almost all of that
file system?), but is probably not the problem.  Much as I hate to do
this :-), please post the entire amdump.1 file that goes with this run.
It's the only way we'll be able to tell what was going on.

John R. Jackson, Technical Software Specialist, [EMAIL PROTECTED]



Re: I had enough holding disk but part of the dump failed -

2000-11-03 Thread Denise Ives

On Fri, 3 Nov 2000, John R. Jackson wrote:

 Here is my amanda.conf file with respect to the holding disk - 
 ...
 use -500Mb ...
 
 That's OK.
 
 chunksize -1 ...
 
 This is not related to your problem, but you should change this at
 some point to something like "1000 Mb".
 
 Is the use wrong?
 
 It might be a little large (why not let Amanda have almost all of that
 file system?), but is probably not the problem.  Much as I hate to do
 this :-), please post the entire amdump.1 file that goes with this run.
 It's the only way we'll be able to tell what was going on.
 
 John R. Jackson, Technical Software Specialist, [EMAIL PROTECTED]
 

-- 
Denise E. Ives  [EMAIL PROTECTED]
Systems Engineer734.822.2037

Multilingual Internet Domain Name Registrations - http://www.walid.com


amdump: start at Fri Nov  3 01:15:00 EST 2000
planner: pid 14161 executable /usr/local/pkg/amanda-2.4.1p1/libexec/planner version 
2.4.1p1
planner: build: VERSION="Amanda-2.4.1p1"
planner:BUILT_DATE="Tue Oct 17 14:52:15 GMT 2000"
planner:BUILT_MACH="SunOS sundev1.corp.walid.com 5.8 Generic sun4u sparc 
SUNW,Ultra-60"
planner:CC="gcc"
planner: paths: bindir="/usr/local/pkg/amanda-2.4.1p1/bin"
planner:sbindir="/usr/local/pkg/amanda-2.4.1p1/sbin"
planner:libexecdir="/usr/local/pkg/amanda-2.4.1p1/libexec"
planner:mandir="/usr/local/pkg/amanda-2.4.1p1/man"
planner:CONFIG_DIR="/usr/local/pkg/amanda-2.4.1p1/etc/amanda"
planner:DEV_PREFIX="/dev/dsk/" RDEV_PREFIX="/dev/rdsk/"
planner:DUMP="/usr/sbin/ufsdump" RESTORE="/usr/sbin/ufsrestore"
planner:GNUTAR="/usr/local/bin/tar" COMPRESS_PATH="/usr/bin/gzip"
planner:UNCOMPRESS_PATH="/usr/bin/gzip" MAILER="/usr/ucb/Mail"
planner:
listed_incr_dir="/usr/local/pkg/amanda-2.4.1p1/var/amanda/gnutar-lists"
planner: defs:  DEFAULT_SERVER="sundev1.corp.walid.com"
planner:DEFAULT_CONFIG="daily"
planner:DEFAULT_TAPE_SERVER="sundev1.corp.walid.com"
planner:DEFAULT_TAPE_DEVICE="/dev/rmt/0bn" HAVE_MMAP HAVE_SYSVSHM
planner:LOCKING=POSIX_FCNTL SETPGRP_VOID DEBUG_CODE BSD_SECURITY
planner:USE_AMANDAHOSTS CLIENT_LOGIN="amanda" FORCE_USERID HAVE_GZIP
planner:COMPRESS_SUFFIX=".gz" COMPRESS_FAST_OPT="--fast"
planner:COMPRESS_BEST_OPT="--best" UNCOMPRESS_OPT="-dc"
READING CONF FILES...
startup took 0.025 secs

SETTING UP FOR ESTIMATES...
setting up estimates for sundev1.corp.walid.com:c0t0d0s0
sundev1.corp.walid.com:c0t0d0s0 overdue 11 days for level 0
setup_estimate: sundev1.corp.walid.com:c0t0d0s0: command 0, options:
last_level 1 next_level0 -11 level_days 2
getting estimates 0 (2049855) 1 (255) 2 (0)
setting up estimates for sundev1.corp.walid.com:c0t0d0s3
sundev1.corp.walid.com:c0t0d0s3 overdue 11 days for level 0
setup_estimate: sundev1.corp.walid.com:c0t0d0s3: command 0, options:
last_level 2 next_level0 -11 level_days 1
getting estimates 0 (481183) 2 (52991) 3 (0)
setting up estimates for sundev1.corp.walid.com:c0t0d0s7
sundev1.corp.walid.com:c0t0d0s7 overdue 11 days for level 0
setup_estimate: sundev1.corp.walid.com:c0t0d0s7: command 0, options:
last_level 0 next_level0 -11 level_days 0
getting estimates 0 (5101407) 1 (737663) -1 (-1)
setting up estimates for admin1.corp.walid.com:sda6
admin1.corp.walid.com:sda6 overdue 11 days for level 0
setup_estimate: admin1.corp.walid.com:sda6: command 0, options:
last_level 1 next_level0 -11 level_days 2
getting estimates 0 (93824) 1 (490) 2 (0)
setting up estimates for admin1.corp.walid.com:sda3
driver: pid 14160 executable /usr/local/pkg/amanda-2.4.1p1/libexec/driver version 
2.4.1p1
driver: send-cmd time 0.022 to taper: START-TAPER 20001103
admin1.corp.walid.com:sda3 overdue 11 days for level 0
setup_estimate: admin1.corp.walid.com:sda3: command 0, options:
last_level 1 next_level0 -11 level_days 2
getting estimates 0 (6020) 1 (19) 2 (0)
setting up estimates for admin1.corp.walid.com:sda5
admin1.corp.walid.com:sda5 overdue 11 days for level 0
setup_estimate: admin1.corp.walid.com:sda5: command 0, options:
last_level 1 next_level0 -11 level_days 2
getting estimates 0 (1594953) 1 (3698) 2 (0)
setting up estimates for admin1.corp.walid.com:sda10
admin1.corp.walid.com:sda10 overdue 11 days for level 0
setup_estimate: admin1.corp.walid.com:sda10: command 0, options:
last_level 0 next_level0 -11 level_days 0
getting estimates 0 (4550177) 1 (0) -1 (-1)
setting up estimates for admin1.corp.walid.com:sda9
admin1.corp.walid.com:sda9 overdue 11 days for level 0
setup_estimate: admin1.corp.walid.com:sda9: command 0, options:
last_level 1 next_level0 -11 level_days 2
getting estimates 0 (58442) 1 (48454) 2 (0)
setting up estimates for admin1.corp.walid.com:sda2
taper: pid 14162 executable taper version 2.4.1p1
admin1.corp.walid.com:sda2 

Re: I had enough holding disk but part of the dump failed -

2000-11-03 Thread John R. Jackson

I think that's what's happening, a dump with est_kps larger than the
bandwidth will never be scheduled to dump on holding disk.

I agree it looks like that could happen in principle (in fact, I wonder
if other such "don't do this now" states should be examined), but in this
particular case the estimated bandwidth for the two disks not processed
was well below the available.

Jean-Louis

John R. Jackson, Technical Software Specialist, [EMAIL PROTECTED]