Regarding the planner timeouts.....

 

 

Here are the amdump logs on the server regarding the clients...

 

 

planner: time 1.154: got partial result for host
twofish.cgx.transora.com disk /home: 0 -> -2K, -1 -> -2K, -1 -> -2K

taper: using label `ON1006L3' date `20081112130548'

driver: result time 26.700 from taper: TAPER-OK

driver: state time 26.700 free kps: 90000000 space: 524288000 taper:
idle idle-dumpers: 27 qlen tapeq: 0 runq: 0 roomq: 0 wakeup: 0
driver-idle: not-idle

driver: interface-state time 26.700 if default: free 90000000

driver: hdisk-state time 26.700 hdisk 0: free 524288000 dumpers 0

planner: time 302.185: getting estimates took 302.181 secs

FAILED QUEUE:

  0: twofish.cgx.transora.com /home

DONE QUEUE: empty

 

ANALYZING ESTIMATES...

planner: FAILED twofish.cgx.transora.com /home 20081112130548 0 "[disk
/home, all estimate timed out]"

INITIAL SCHEDULE (size 64):

 

DELAYING DUMPS IF NEEDED, total_size 64, tape length 703561728 mark 0

  delay: Total size now 64.

 

PROMOTING DUMPS IF NEEDED, total_lev0 0, balanced_size 0...

planner: time 302.186: analysis took 0.000 secs

 

oldlog errors:

 

STATS driver startup time 0.182

ERROR planner Request to twofish.cgx.transora.com failed: timeout
waiting for REP

WARNING planner disk twofish.cgx.transora.com:/home, estimate of level 0
timed out.

FAIL planner twofish.cgx.transora.com /home 20081112130548 0 "[disk
/home, all estimate timed out]"

FINISH planner date 20081112130548 time 302.186

WARNING driver WARNING: got empty schedule from planner

FINISH driver date 20081112130548 time 303.225

 

 

Client logs:

 

Run tar log:

1226516749.495366: runtar: pid 27706 ruid 30063 euid 0: start at Wed Nov
12 13:05:49 2008

1226516749.495652: runtar: version 2.6.0p2

1226516749.505371: runtar: /usr/local/bin/tar version: tar (GNU tar)
1.19

 

1226516749.505857: runtar: config: Full

1226516749.516449: runtar: pid 27706 ruid 0 euid 0: rename at Wed Nov 12
13:05:49 2008

1226516749.516894: runtar: running: /usr/local/bin/tar --create --file
/dev/null --directory /home --one-file-system --listed-incremental
/usr/local/var/amanda/gnutar-lists/twofish.cgx.transora.com_home_0.new
--sparse --ignore-failed-read --totals --exclude-from
/tmp/amanda/sendsize._home.20081112130549.exclude .

1226516749.516936: runtar: pid 27706 finish time Wed Nov 12 13:05:49
2008

 

Sendsize log:

1226516749.413525: sendsize: pid 27703 ruid 30063 euid 30063: start at
Wed Nov 12 13:05:49 2008

1226516749.413851: sendsize: version 2.6.0p2

1226516749.414107: sendsize: warning: errors processing config file
"/etc/amanda/amanda-client.conf" (non-fatal)

1226516749.414762: sendsize: warning: errors processing config file
"/etc/amanda/Full/amanda-client.conf" (non-fatal)

1226516749.454464: sendsize: pid 27703 ruid 30063 euid 30063: rename at
Wed Nov 12 13:05:49 2008

1226516749.457190: sendsize: waiting for any estimate child: 1 running

1226516749.458632: sendsize: calculating for amname /home, dirname
/home, spindle -1

1226516749.458774: sendsize: getting size via gnutar for /home level 0

1226516749.461231: sendsize: Spawning
"/opt/amanda/client/libexec/amanda/runtar runtar Full /usr/local/bin/tar
--create --file /dev/null --directory /home --one-file-system
--listed-incremental
/usr/local/var/amanda/gnutar-lists/twofish.cgx.transora.com_home_0.new
--sparse --ignore-failed-read --totals --exclude-from
/tmp/amanda/sendsize._home.20081112130549.exclude ." in pipeline

 

amandad.20081112130548.debug log:

 

1226516749.378493: amandad: creating new service: sendsize

OPTIONS
features=ffffffff9ffeffffffff00;maxdumps=8;hostname=twofish.cgx.transora
.com;config=Full;

GNUTAR /home  0 1970:1:1:0:0:0 -1  OPTIONS
|;auth=BSD;index;exclude-list=/home/amanda/exclude_list;

 

1226516749.381505: amandad: sending ACK pkt:

<<<<< 

>>>>> 

1226516749.381671: amandad: dgram_send_addr(addr=26180, dgram=ff350c3c)

1226516749.381692: amandad: (sockaddr_in *)26180 = { 2, 1018, 10.1.1.104
}

1226516749.381708: amandad: dgram_send_addr: ff350c3c->socket = 0

1226516749.414819: amandad: sending PREP pkt:

<<<<< 

OPTIONS features=ffffffff9ffeffffffff00;

>>>>> 

1226516749.414857: amandad: dgram_send_addr(addr=26180, dgram=ff350c3c)

1226516749.414874: amandad: (sockaddr_in *)26180 = { 2, 1018, 10.1.1.104
}

1226516749.414912: amandad: dgram_send_addr: ff350c3c->socket = 0

1226516850.419333: amandad: dgram_recv(dgram=ff350c3c, timeout=0,
fromaddr=ff360c28)

1226516850.419470: amandad: (sockaddr_in *)ff360c28 = { 2, 1018,
10.1.1.104 }

1226516850.420038: amandad: received REQ pkt:

<<<<< 

SERVICE sendsize

OPTIONS
features=ffffffff9ffeffffffff00;maxdumps=8;hostname=twofish.cgx.transora
.com;config=Full;

GNUTAR /home  0 1970:1:1:0:0:0 -1  OPTIONS
|;auth=BSD;index;exclude-list=/home/amanda/exclude_list;

>>>>> 

1226516850.420071: amandad: received dup P_REQ packet, ACKing it

1226516850.420086: amandad: sending ACK pkt:

<<<<< 

>>>>> 

1226516850.420108: amandad: dgram_send_addr(addr=26180, dgram=ff350c3c)

1226516850.420122: amandad: (sockaddr_in *)26180 = { 2, 1018, 10.1.1.104
}

1226516850.420135: amandad: dgram_send_addr: ff350c3c->socket = 0

1226516950.427759: amandad: dgram_recv(dgram=ff350c3c, timeout=0,
fromaddr=ff360c28)

1226516950.427881: amandad: (sockaddr_in *)ff360c28 = { 2, 1018,
10.1.1.104 }

1226516950.428320: amandad: received REQ pkt:

<<<<< 

SERVICE sendsize

OPTIONS
features=ffffffff9ffeffffffff00;maxdumps=8;hostname=twofish.cgx.transora
.com;config=Full;

GNUTAR /home  0 1970:1:1:0:0:0 -1  OPTIONS
|;auth=BSD;index;exclude-list=/home/amanda/exclude_list;

>>>>> 

1226516950.428345: amandad: received dup P_REQ packet, ACKing it

1226516950.428361: amandad: sending ACK pkt:

<<<<< 

>>>>> 

1226516950.428383: amandad: dgram_send_addr(addr=26180, dgram=ff350c3c)

1226516950.428397: amandad: (sockaddr_in *)26180 = { 2, 1018, 10.1.1.104
}

1226516950.428410: amandad: dgram_send_addr: ff350c3c->socket = 0

 

This is only occurring on 2 of my clients I have removed all other
clients and currently just testing one client with the behavior.

Increased the etimeout value but that didn't seem to help.

 

Anderson 

 

 

 

 

Reply via email to