Re: amfetchdump returns 'Error writing to fd 7: Broken pipe'

2017-02-20 Thread Jean-Francois Malouin
* Jean-Louis Martineau  [20170220 14:24]:
> On 20/02/17 11:47 AM, Jean-Francois Malouin wrote:
> > Hi,
> >
> > Attempting to restore a DLE with amfetchdump with:
> >
> > su amanda -c "~amanda/sbin/amfetchdump -p -d tape:/dev/nst0 top localhost 
> > /data_/mica1 20170219 " | tar -xpGf -
> >
> > returns 'Error writing to fd 7: Broken pipe'. The logfile shows:
> Assuming fd 7 is the pipe to tar, it means that tar exited before 
> amfetchdump send all the data, do you get any error from tar?
> You can try to do it in 2 step:
> 
> $ ~amanda/sbin/amfetchdump -p -d tape:/dev/nst0 top localhost /data_/mica1 
> 20170219 > datafile
> 
> Do the data file contain the backup? ls -l, file datafile
> 
> $ tar -xpGf datafile
> 
> Check system/selinux logs for error.

Nothing relevant shows up in the host's syslog.
Doing the restore in 2 steps as proposed by Jean-Louis terminates
without an error:

~# pwd
/holddisk/restore
~# su amanda -c "~amanda/sbin/amfetchdump -p -d /dev/nst0 top localhost 
/data_/mica1 20170219" > mica1.datafile
dow:/holddisk/restore# echo $?
0
~# tar -C /holddisk/restore/mica1 -xpGf mica1.datafile
~# echo $?
0

The amfetchdump debug file shows all is good.

I know I had 2 issues in the same post: what about the 2nd one,
amrestore not doing recursive restore?

thanks,
jf

> 
> Jean-Louis
> 
> >
> > Mon Feb 20 11:26:03.128402078 2017: pid 14583: thd-0x2e49c00: amfetchdump: 
> > xfer_cancel_with_error: Error writing to fd 7: Broken pipe
> > Mon Feb 20 11:26:03.128432851 2017: pid 14583: thd-0x2e49c00: amfetchdump: 
> > xfer_queue_message: MSG:  > elt= version=0>
> > Mon Feb 20 11:26:03.128457912 2017: pid 14583: thd-0x2e49c00: amfetchdump: 
> > xfer_queue_message: MSG:  > elt= version=0>
> > Mon Feb 20 11:26:03.128835292 2017: pid 14583: thd-0x1502000: amfetchdump: 
> > Cancelling  -> 
> > )>
> > Mon Feb 20 11:26:03.128867340 2017: pid 14583: thd-0x2e49c00: amfetchdump: 
> > sending XMSG_CRC message 0x317c810
> > Mon Feb 20 11:26:03.128878700 2017: pid 14583: thd-0x2e49c00: amfetchdump: 
> > pull_and_write CRC: f9201a62  size 102900957184
> > Mon Feb 20 11:26:03.128886874 2017: pid 14583: thd-0x2e49c00: amfetchdump: 
> > xfer_queue_message: MSG:  > elt= version=0>
> > Mon Feb 20 11:26:03.128900594 2017: pid 14583: thd-0x2e49c00: amfetchdump: 
> > xfer_queue_message: MSG:  > elt= version=0>
> > Mon Feb 20 11:26:03.129525236 2017: pid 14583: thd-0x1502000: amfetchdump: 
> > dest_crc: f9201a62:102900957184
> > Mon Feb 20 11:26:03.129946543 2017: pid 14583: thd-0x1502000: amfetchdump: 
> > /opt/amanda/perl/Amanda/Restore.pm:1892:info:4900012 100491264 kb
> > Mon Feb 20 11:26:03.130359016 2017: pid 14583: thd-0x1502000: amfetchdump: 
> > /opt/amanda/perl/Amanda/Restore.pm:1920:error:4900055 Error writing to fd 
> > 7: Broken pipe
> > Mon Feb 20 11:26:03.130501339 2017: pid 14583: thd-0x1502000: amfetchdump: 
> > /opt/amanda/perl/Amanda/Restore.pm:2137:error:4900068 Error writing to fd 
> > 7: Broken pipe
> >
> > What is the error all about?
> >
> > A previous amrecover run seems to have gone to all the motions
> > correctly, but in the end the restore done in temp dir yield
> > nothing...an empty directory...
> >
> > Any hint?
> > Thanks,
> > jf


Re: amfetchdump returns 'Error writing to fd 7: Broken pipe'

2017-02-20 Thread Jean-Louis Martineau
On 20/02/17 11:47 AM, Jean-Francois Malouin wrote:
> Hi,
>
> Attempting to restore a DLE with amfetchdump with:
>
> su amanda -c "~amanda/sbin/amfetchdump -p -d tape:/dev/nst0 top localhost 
> /data_/mica1 20170219 " | tar -xpGf -
>
> returns 'Error writing to fd 7: Broken pipe'. The logfile shows:
Assuming fd 7 is the pipe to tar, it means that tar exited before 
amfetchdump send all the data, do you get any error from tar?
You can try to do it in 2 step:

$ ~amanda/sbin/amfetchdump -p -d tape:/dev/nst0 top localhost /data_/mica1 
20170219 > datafile

Do the data file contain the backup? ls -l, file datafile

$ tar -xpGf datafile

Check system/selinux logs for error.

Jean-Louis

>
> Mon Feb 20 11:26:03.128402078 2017: pid 14583: thd-0x2e49c00: amfetchdump: 
> xfer_cancel_with_error: Error writing to fd 7: Broken pipe
> Mon Feb 20 11:26:03.128432851 2017: pid 14583: thd-0x2e49c00: amfetchdump: 
> xfer_queue_message: MSG:  elt= version=0>
> Mon Feb 20 11:26:03.128457912 2017: pid 14583: thd-0x2e49c00: amfetchdump: 
> xfer_queue_message: MSG:  elt= version=0>
> Mon Feb 20 11:26:03.128835292 2017: pid 14583: thd-0x1502000: amfetchdump: 
> Cancelling  -> 
> )>
> Mon Feb 20 11:26:03.128867340 2017: pid 14583: thd-0x2e49c00: amfetchdump: 
> sending XMSG_CRC message 0x317c810
> Mon Feb 20 11:26:03.128878700 2017: pid 14583: thd-0x2e49c00: amfetchdump: 
> pull_and_write CRC: f9201a62  size 102900957184
> Mon Feb 20 11:26:03.128886874 2017: pid 14583: thd-0x2e49c00: amfetchdump: 
> xfer_queue_message: MSG:  elt= version=0>
> Mon Feb 20 11:26:03.128900594 2017: pid 14583: thd-0x2e49c00: amfetchdump: 
> xfer_queue_message: MSG:  elt= version=0>
> Mon Feb 20 11:26:03.129525236 2017: pid 14583: thd-0x1502000: amfetchdump: 
> dest_crc: f9201a62:102900957184
> Mon Feb 20 11:26:03.129946543 2017: pid 14583: thd-0x1502000: amfetchdump: 
> /opt/amanda/perl/Amanda/Restore.pm:1892:info:4900012 100491264 kb
> Mon Feb 20 11:26:03.130359016 2017: pid 14583: thd-0x1502000: amfetchdump: 
> /opt/amanda/perl/Amanda/Restore.pm:1920:error:4900055 Error writing to fd 7: 
> Broken pipe
> Mon Feb 20 11:26:03.130501339 2017: pid 14583: thd-0x1502000: amfetchdump: 
> /opt/amanda/perl/Amanda/Restore.pm:2137:error:4900068 Error writing to fd 7: 
> Broken pipe
>
> What is the error all about?
>
> A previous amrecover run seems to have gone to all the motions
> correctly, but in the end the restore done in temp dir yield
> nothing...an empty directory...
>
> Any hint?
> Thanks,
> jf


Re: amfetchdump returns 'Error writing to fd 7: Broken pipe'

2017-02-20 Thread Jean-Francois Malouin
* Jean-Francois Malouin  [20170220 
11:47]:
> Hi,
> 
> Attempting to restore a DLE with amfetchdump with:

Forgot to add: this with amanda-3.4.2

> 
> su amanda -c "~amanda/sbin/amfetchdump -p -d tape:/dev/nst0 top localhost 
> /data_/mica1 20170219 " | tar -xpGf -
> 
> returns 'Error writing to fd 7: Broken pipe'. The logfile shows:
> 
> Mon Feb 20 11:26:03.128402078 2017: pid 14583: thd-0x2e49c00: amfetchdump: 
> xfer_cancel_with_error: Error writing to fd 7: Broken pipe
> Mon Feb 20 11:26:03.128432851 2017: pid 14583: thd-0x2e49c00: amfetchdump: 
> xfer_queue_message: MSG:  elt= version=0>
> Mon Feb 20 11:26:03.128457912 2017: pid 14583: thd-0x2e49c00: amfetchdump: 
> xfer_queue_message: MSG:  elt= version=0>
> Mon Feb 20 11:26:03.128835292 2017: pid 14583: thd-0x1502000: amfetchdump: 
> Cancelling  -> 
> )>
> Mon Feb 20 11:26:03.128867340 2017: pid 14583: thd-0x2e49c00: amfetchdump: 
> sending XMSG_CRC message 0x317c810
> Mon Feb 20 11:26:03.128878700 2017: pid 14583: thd-0x2e49c00: amfetchdump: 
> pull_and_write CRC: f9201a62  size 102900957184
> Mon Feb 20 11:26:03.128886874 2017: pid 14583: thd-0x2e49c00: amfetchdump: 
> xfer_queue_message: MSG:  elt= version=0>
> Mon Feb 20 11:26:03.128900594 2017: pid 14583: thd-0x2e49c00: amfetchdump: 
> xfer_queue_message: MSG:  elt= version=0>
> Mon Feb 20 11:26:03.129525236 2017: pid 14583: thd-0x1502000: amfetchdump: 
> dest_crc: f9201a62:102900957184
> Mon Feb 20 11:26:03.129946543 2017: pid 14583: thd-0x1502000: amfetchdump: 
> /opt/amanda/perl/Amanda/Restore.pm:1892:info:4900012 100491264 kb
> Mon Feb 20 11:26:03.130359016 2017: pid 14583: thd-0x1502000: amfetchdump: 
> /opt/amanda/perl/Amanda/Restore.pm:1920:error:4900055 Error writing to fd 7: 
> Broken pipe
> Mon Feb 20 11:26:03.130501339 2017: pid 14583: thd-0x1502000: amfetchdump: 
> /opt/amanda/perl/Amanda/Restore.pm:2137:error:4900068 Error writing to fd 7: 
> Broken pipe
> 
> What is the error all about?
> 
> A previous amrecover run seems to have gone to all the motions
> correctly, but in the end the restore done in temp dir yield
> nothing...an empty directory...

Addendum to the amrecover:

Doing 'add .' at the prompt in amrecover apparently will not descend
recursively in the DLE: it just seems to add '.', and that's it.  
'add *' will just add the matching files and directories, but again will
not do a recursive restore.

Makes amrecover a bit useless unless to restore single files.
That's not the behaviour I had with previous version.
Am I doing something wrong or overlooking something simple?

jf


> 
> Any hint?
> Thanks,
> jf

-- 
Jean-Francois Malouin | IT Operations and Infrastructure
McConnell Brain Imaging Centre | MNI | McGill University


amfetchdump returns 'Error writing to fd 7: Broken pipe'

2017-02-20 Thread Jean-Francois Malouin
Hi,

Attempting to restore a DLE with amfetchdump with:

su amanda -c "~amanda/sbin/amfetchdump -p -d tape:/dev/nst0 top localhost 
/data_/mica1 20170219 " | tar -xpGf -

returns 'Error writing to fd 7: Broken pipe'. The logfile shows:

Mon Feb 20 11:26:03.128402078 2017: pid 14583: thd-0x2e49c00: amfetchdump: 
xfer_cancel_with_error: Error writing to fd 7: Broken pipe
Mon Feb 20 11:26:03.128432851 2017: pid 14583: thd-0x2e49c00: amfetchdump: 
xfer_queue_message: MSG:  version=0>
Mon Feb 20 11:26:03.128457912 2017: pid 14583: thd-0x2e49c00: amfetchdump: 
xfer_queue_message: MSG:  version=0>
Mon Feb 20 11:26:03.128835292 2017: pid 14583: thd-0x1502000: amfetchdump: 
Cancelling  -> 
)>
Mon Feb 20 11:26:03.128867340 2017: pid 14583: thd-0x2e49c00: amfetchdump: 
sending XMSG_CRC message 0x317c810
Mon Feb 20 11:26:03.128878700 2017: pid 14583: thd-0x2e49c00: amfetchdump: 
pull_and_write CRC: f9201a62  size 102900957184
Mon Feb 20 11:26:03.128886874 2017: pid 14583: thd-0x2e49c00: amfetchdump: 
xfer_queue_message: MSG:  version=0>
Mon Feb 20 11:26:03.128900594 2017: pid 14583: thd-0x2e49c00: amfetchdump: 
xfer_queue_message: MSG:  version=0>
Mon Feb 20 11:26:03.129525236 2017: pid 14583: thd-0x1502000: amfetchdump: 
dest_crc: f9201a62:102900957184
Mon Feb 20 11:26:03.129946543 2017: pid 14583: thd-0x1502000: amfetchdump: 
/opt/amanda/perl/Amanda/Restore.pm:1892:info:4900012 100491264 kb
Mon Feb 20 11:26:03.130359016 2017: pid 14583: thd-0x1502000: amfetchdump: 
/opt/amanda/perl/Amanda/Restore.pm:1920:error:4900055 Error writing to fd 7: 
Broken pipe
Mon Feb 20 11:26:03.130501339 2017: pid 14583: thd-0x1502000: amfetchdump: 
/opt/amanda/perl/Amanda/Restore.pm:2137:error:4900068 Error writing to fd 7: 
Broken pipe

What is the error all about?

A previous amrecover run seems to have gone to all the motions
correctly, but in the end the restore done in temp dir yield
nothing...an empty directory...

Any hint?
Thanks,
jf


Re: amcheckdump fails with broken pipe

2011-01-23 Thread omer
Hi,

Le vendredi 21 janvier 2011 17:55:50, vous avez écrit :
> I am finding that amcheckdump (3.2.0) gets sporadic failures with messages
> like
> 
> Reading volume AMB009L4 file 9
> Validation errors:
> Error writing to fd 9: Broken pipe
> /bin/gzip exited with status 1

maybe you should check the logs to have the gzip error message ?

> 
> A similar problem was reported in
> http://www.mail-archive.com/amanda-users@amanda.org/msg44288.html
> but I'm not sure whether it is meant to be fixed in 3.2.0.
> 
> In my configuration tar is not involved, the backup files are in
> compressed dump format. So amcheckdump creates a pipe containing gzip and
> restore; presumably gzip terminates when it gets to the end of the real
> data and does not bother reading the zero padding at the end of the last
> block. Unlike with tar I don't know any way of forcing gzip to
> read till the bitter end.
> 
> My blocks are 512kB and they are read over a slowish iscsi connection so
> that possibly gives gzip plenty of time to process its data while the
> rest of the block is still being read.
> 
> I can live with the problem, but it is a bit irritating so any comments
> welcome.
> 
> Bob

not sure it is related but I had some similar problem when trying to write my 
own verification script. I did something like that :

amfetchdump -p -h ... > FETCH_PIPE &
( dd bs=32k of=HEADER_PIPE count=1; dd bs=32k of=DUMP_PIPE; ) 

amcheckdump fails with broken pipe

2011-01-21 Thread Bob Vickers
I am finding that amcheckdump (3.2.0) gets sporadic failures with messages 
like


Reading volume AMB009L4 file 9
Validation errors:
Error writing to fd 9: Broken pipe
/bin/gzip exited with status 1

A similar problem was reported in
http://www.mail-archive.com/amanda-users@amanda.org/msg44288.html
but I'm not sure whether it is meant to be fixed in 3.2.0.

In my configuration tar is not involved, the backup files are in 
compressed dump format. So amcheckdump creates a pipe containing gzip and 
restore; presumably gzip terminates when it gets to the end of the real 
data and does not bother reading the zero padding at the end of the last 
block. Unlike with tar I don't know any way of forcing gzip to 
read till the bitter end.


My blocks are 512kB and they are read over a slowish iscsi connection so 
that possibly gives gzip plenty of time to process its data while the 
rest of the block is still being read.


I can live with the problem, but it is a bit irritating so any comments 
welcome.


Bob


Fw: Amanda 2.5.2p1-1 - Broken Pipe .....again

2009-08-18 Thread Yogesh Hasabnis
Additional information about this problem.



An additional external USB hard drive has been added to this host about
one month ago. The filesystem on this hard drive is not backed up with
Amanda. But the observation is that the "Broken Pipe" problem seems to
disappear when the filesystem on this hard disk is unmounted from the
root filesystem.



Any suggestions for resolving the problem are appreciated.



Thanks in advance,

Yogesh






  

amdump.1
Description: Binary data


Amanda 2.5.2p1-1 - Broken Pipe .....again

2009-08-11 Thread Yogesh Hasabnis
Hi All,



We use Amanda-2.5.2p1 for our daily incremental backup requirements. A
single filesystem on a single host is backed up in this setup. The
backup server and the client is the same host. The OS on the host is
RHEL 3. The setup has been working reasonably fine for last two and
half years, except for a few problems once in a while. Currently I am
facing a problem in which the backup has failed for last one week and
the error message that I receive in the email message is as follows:



FAILURE AND STRANGE DUMP SUMMARY:
  /vol/vol1/home  lev 0  FAILED [data write: Broken pipe]
 /vol/vol1/home  lev 0  FAILED [input: Can't read data: : 
Connection reset by peer]
  /vol/vol1/home  lev 0  FAILED [dump to tape failed]


The same error was observed about a month ago. I had browsed through the wiki, 
in which I couldn't find a solution. So I just restarted my backup 
client/server which had solved the problem. I am trying to figure out whether 
the issue can be resolved without restarting the host. I have attached the 
amdump.1 file with this email. Would appreciate if I could get some guidance to 
resolve this issue and make sure that it doesn't occur again.

Thanks,
Yogesh

 










  

amdump.1
Description: Binary data


Re: FAILED [data write: Broken pipe]

2009-06-18 Thread Yogesh Hasabnis


--- On Thu, 6/18/09, Jean-Louis Martineau  wrote:


From: Jean-Louis Martineau 
Subject: Re: FAILED [data write: Broken pipe]
To: "Yogesh Hasabnis" 
Cc: amanda-users@amanda.org
Date: Thursday, June 18, 2009, 6:02 PM


Yogesh Hasabnis wrote:
> gzip: stdout: Connection timed out
This is weird, do your system automaticaly close connection after some time?
After how many it happens? is it always the same?

Jean-Louis

No it doesn't. The backup system with amanda was working fine all these days. 
The problem has cropped up since last 4 days. All of the last 4 daily backups 
have failed. I am attaching the error message received in the email again. It's 
as follows:
 
FAILURE AND STRANGE DUMP SUMMARY:
    /vol/vol1/home  lev 0  FAILED [data write: Broken pipe]
    /vol/vol1/home  lev 0  FAILED [input: Can't read data: : 
Connection reset by peer]
    /vol/vol1/home  lev 0  FAILED [dump to tape failed]


STATISTICS:
  Total   Full  Incr.
          
Estimate Time (hrs:min)    1:26
Run Time (hrs:min) 5:27
Dump Time (hrs:min)    0:00   0:00   0:00
Output Size (meg)   0.0    0.0    0.0
Original Size (meg) 0.0    0.0    0.0
Avg Compressed Size (%) -- -- -- 
Filesystems Dumped    0  0  0
Avg Dump Rate (k/s) -- -- -- 

Tape Time (hrs:min)    0:00   0:00   0:00
Tape Size (meg) 0.0    0.0    0.0
Tape Used (%)   0.0    0.0    0.0
Filesystems Taped 0  0  0

Chunks Taped  0  0  0
Avg Tp Write Rate (k/s) -- -- -- 

USAGE BY TAPE:
  Label Time  Size  %    Nb    Nc
  tendaily-08   0:00    0K    0.0 0 0

 
FAILED AND STRANGE DUMP DETAILS:

/--   /vol/vol1/home lev 0 FAILED [data write: Broken pipe]
sendbackup: start [toe.tti.tensilica.com:/vol/vol1/home level 0]
sendbackup: info BACKUP=/bin/gtar
sendbackup: info RECOVER_CMD=/bin/gtar -f... -
sendbackup: info end
\

 
NOTES:
  planner: Incremental of toe.tti.tensilica.com:/vol/vol1/home bumped to level 
2.
  taper: tape tendaily-08 kb 32064 fm 1 [OK]

 
DUMP SUMMARY:
   DUMPER STATS   TAPER STATS 
HOSTNAME DISK    L ORIG-KB  OUT-KB  COMP%  MMM:SS   KB/s MMM:SS   KB/s
-- - -
 -/vol1/home 0 FAILED 

(brought to you by Amanda version 2.5.0p2)
 
Thanks in advance,
Yogesh



  

Re: FAILED [data write: Broken pipe]

2009-06-18 Thread Jean-Louis Martineau

Post the complete amdump.? file

Yogesh Hasabnis wrote:



--- On *Thu, 6/18/09, Jean-Louis Martineau //* 
wrote:


No it doesn't. The backup system with amanda was working fine all
these days. The problem has cropped up since last 4 days. All of
the last 4 daily backups have failed. I am attaching the error
message received in the email again. It's as follows:
 





Re: FAILED [data write: Broken pipe]

2009-06-18 Thread Jean-Louis Martineau

Yogesh Hasabnis wrote:

gzip: stdout: Connection timed out

This is weird, do your system automaticaly close connection after some time?
After how many it happens? is it always the same?

Jean-Louis


Re: FAILED [data write: Broken pipe]

2009-06-18 Thread Yogesh Hasabnis
Any suggestions ? My backup has failed for 3 days now.




  

FAILED [data write: Broken pipe]

2009-06-17 Thread Yogesh Hasabnis

Hi All,



We use Amanda 2.5.0p2 in our backup setup. We have the same host acting
as the amanda backup server as well as the client. The OS on the backup 
server/client host is RHEL3. The setup has been
working reasonably OK for the last 2 years. 

I am facing problems in backups for the last 2 days. The error message that I 
get in the email message is as follows:

FAILURE AND STRANGE DUMP SUMMARY:
/vol/vol1/home  lev 2  FAILED [data write: Broken pipe]
/vol/vol1/home  lev 2  FAILED [data write: Broken pipe

I see the following text in the /var/log/amanda//amdump file

gzip: stdout: Connection timed out
dumper: kill compress command
driver: state time 29736.268 free kps: 1759 space: 284000 taper: idle 
idle-dumpers: 3 qlen tapeq: 0 runq: 0 roomq: 0 wakeup: 0 driver-idle: no-dumpers
dumper: kill index command
driver: interface-state time 29736.268 if : free 359 if LE0: free 400 if LOCAL:
free 1000
driver: hdisk-state time 29736.268 hdisk 0: free 284000 dumpers 1
driver: result time 29736.268 from dumper0: FAILED 00-2 [data write: Broken
pipe]
driver: send-cmd time 29736.268 to chunker0: FAILED 00-2
driver: state time 35868.133 free kps: 1759 space: 284000 taper: idle 
idle-dumpers: 3 qlen tapeq: 0 runq: 0 roomq: 0 wakeup: 0 driver-idle: no-dumpers
driver: interface-state time 35868.133 if : free 359 if LE0: free 400 if LOCAL:
free 1000
driver: hdisk-state time 35868.133 hdisk 0: free 284000 dumpers 1
driver: result time 35868.133 from chunker0: PARTIAL 00-2 60480 [sec 
14769.366 kb 60480 kps 4.1]
driver: finished-cmd time 35868.133 chunker0 chunked :/vol/vol1/home


Any help or suggestions in debugging this problem are appreciated.

Thanks in advance.
Yogesh







  



sendbackup: broken pipe

2008-11-24 Thread Jukka Salmi
Hello,

every now and then I see this error on an otherwise perfectly working
Amanda system (server 2.4.4p3, client 2.5.1p1):

FAILURE AND STRANGE DUMP SUMMARY:
  host.staso / lev 0 FAILED [data timeout]


The client's sendbackup log for such a failed run:

sendbackup: debug 1 pid 14726 ruid 34 euid 34: start at Fri Nov 21 22:46:00 2008
sendbackup: version 2.5.1p1
Could not open conf file "/etc/amanda/amanda-client.conf": No such file or 
directory
  sendbackup req: 
  parsed request as: program `GNUTAR'
 disk `/'
 device `/'
 level 0
 since 1970:1:1:0:0:0
 options 
`|;bsd-auth;no-record;index;exclude-list=.backup.exc;exclude-optional;'
sendbackup: start: host.stasoft.ch:/ lev 0
sendbackup-gnutar: time 0.006: doing level 0 dump as listed-incremental to 
'/var/lib/amanda/gnutar-lists/host.stasoft.ch__0.new'
sendbackup-gnutar: time 0.010: doing level 0 dump from date: 1970-01-01  
0:00:00 GMT
sendbackup: time 0.015: spawning /usr/lib/amanda/runtar in pipeline
sendbackup: argument list: runtar NOCONFIG gtar --create --file - --directory / 
--one-file-system --listed-incremental 
/var/lib/amanda/gnutar-lists/host.stasoft.ch__0.new --sparse 
--ignore-failed-read --totals --exclude-from 
/tmp/amanda/sendbackup._.20081121224600.exclude .
sendbackup: time 0.078: started index creator: "/bin/tar -tf - 2>/dev/null | 
sed -e 's/^\.//'"
sendbackup-gnutar: time 0.082: /usr/lib/amanda/runtar: pid 14729
sendbackup: time 0.083: started backup
sendbackup: time 2996.696: index tee cannot write [Broken pipe]
sendbackup: time 2996.718: pid 14728 finish time Fri Nov 21 23:35:57 2008
sendbackup: time 2996.697: 118: strange(?): sendbackup: index tee cannot write 
[Broken pipe]
sendbackup: time 2996.751: 118: strange(?): sed: couldn't flush stdout: Broken 
pipe
sendbackup: time 2996.779:  47:size(|): Total bytes written: 16497745920 
(16GiB, ?/s)
sendbackup: time 2996.780: 118: strange(?): gtar: -: Wrote only 4096 of 10240 
bytes
sendbackup: time 2996.781: 118: strange(?): gtar: Error is not recoverable: 
exiting now
sendbackup: time 2996.781: parsed backup messages
sendbackup: time 2996.781: pid 14726 finish time Fri Nov 21 23:35:57 2008


On the server side, the amdump log contains:

[...]
driver: send-cmd time 2015.907 to dumper0: FILE-DUMP 01-00032 
/amholdingdisk/20081121/host.stasoft.ch._.0 host.stasoft.ch feff9ffeff / 
NODEVICE 0 1970:1:1:0:0:0 1048576 GNUTAR 49498848 
|;bsd-auth;no-record;index;exclude-list=.backup.exc;exclude-optional;
[...]
driver: result time 6812.702 from dumper0: FAILED 01-00032 [data timeout]
[...]


Hmm, but most of the time I do _not_ see this problem.  Any hints what
could cause the problem or how to debug this further?  


Regards, Jukka

-- 
bashian roulette:
$ ((RANDOM%6)) || rm -rf ~


data-write: Broken pipe

2007-08-17 Thread Yogesh Hasabnis

Hi,

We use amanda-2.5.0 for our backup requirements. 

The backup failed yesterday and the error message received in the email

is as given below. Would like to get some advice about why this error may have 
occurred.

FAILURE AND STRANGE DUMP SUMMARY:toe.tti.tensilica.com  /vol/vol1/home  lev 
3  FAILED [data write: Broken pipe]driver: FATAL Don't know how to send 
ABORT command to chunkerchunker: FATAL error [bad command after 
RQ-MORE-DISK: "QUIT"]

 

Also, after running the "ps -ef" command I observe that the "amdump" process is 
still running. The ouput of 

"ps -ef | grep amanda" is as follows:

30318 20267 1 0 Aug16 ? 00:00:00 /usr/lib/amanda/sendbackup
root 20273 20267 93 Aug16 ? 16:00:37 gtar --create --file - --directory 
/vol/vol1/home --one-file-system --listed-incremental 
/var/lib/amanda/gnutar-lists/toe.tti.tensilica.com_vol_vol1_home_3.new --sparse 
--ignore-failed-read --totals .
 
Is it OK if I kill this process using "kill -15 " or should I
use something like "amcleanup" before I execute today's backup?
 
Thanks in advance.
 
Yogesh


 

   
-
Sick sense of humor? Visit Yahoo! TV's Comedy with an Edge to see what's on, 
when. 

Re: index tee cannot write [Broken pipe]

2007-08-02 Thread Mike Gallant

Hi,

Thanks, I checked the network setting and I don't find ipv6 active on  
any interface but I will keep checking a bit deeper in case it was on  
somewhere else.


The link and error Paul pointed out may also have some merit.  I will  
use ipmon and snoop tonight when the backup kicks off to see what is  
going on the network.  I didn't see any references to the client on  
the server (in the /tmp/amanda/. debug files which strikes me  
funny).


Thanks for the the help and pointers,

Mike
On Aug 2, 2007, at 2:47 PM, Dustin J. Mitchell wrote:


On Thu, Aug 02, 2007 at 09:56:12AM -0400, Mike Gallant wrote:
I have an amanda (2.5.1p3) server (solaris10) with seven amanda  
clients (2.4.4) and a new amanda (2.5.2) client on new hardware.   
I have been getting a "index tee cannot write
[Broken pipe]" error message.  I have looked at all the various  
aspects I can think of but still no joy.  Could someone point me  
in the right direction for resolving this.


I have included the sendbackup.debug (client) and a good part of  
the amandad (client) missing the repetitive entries.
amandad: time 1800.842: stream_server: bind(in6addr_any) failed:  
Invalid argument
amandad: time 1800.842: security_seterror(handle=30d90,  
driver=ff2dd95c (BSD) error=can't create server stream: Invalid  
argument)


This looks like an IPv6 networking problem.  Several such problems  
were

fixed in the 2.5.1 -> 2.5.2 upgrade.  Perhaps you could try upgrading?

Dustin

--
Dustin J. Mitchell
Storage Software Engineer, Zmanda, Inc.
http://www.zmanda.com/




Re: index tee cannot write [Broken pipe]

2007-08-02 Thread Dustin J. Mitchell
On Thu, Aug 02, 2007 at 09:56:12AM -0400, Mike Gallant wrote:
> I have an amanda (2.5.1p3) server (solaris10) with seven amanda clients 
> (2.4.4) and a new amanda (2.5.2) client on new hardware.  I have been getting 
> a "index tee cannot write 
> [Broken pipe]" error message.  I have looked at all the various aspects I can 
> think of but still no joy.  Could someone point me in the right direction for 
> resolving this.
> 
> I have included the sendbackup.debug (client) and a good part of the amandad 
> (client) missing the repetitive entries.
> amandad: time 1800.842: stream_server: bind(in6addr_any) failed: Invalid 
> argument
> amandad: time 1800.842: security_seterror(handle=30d90, driver=ff2dd95c (BSD) 
> error=can't create server stream: Invalid argument)

This looks like an IPv6 networking problem.  Several such problems were
fixed in the 2.5.1 -> 2.5.2 upgrade.  Perhaps you could try upgrading?

Dustin

-- 
Dustin J. Mitchell
Storage Software Engineer, Zmanda, Inc.
http://www.zmanda.com/


Re: index tee cannot write [Broken pipe]

2007-08-02 Thread Paul Bijnens

On 2007-08-02 15:56, Mike Gallant wrote:
I have an amanda (2.5.1p3) server (solaris10) with seven amanda clients 
(2.4.4) and a new amanda (2.5.2) client on new hardware.  I have been 
getting a "index tee cannot write [Broken pipe]" error message.  I have 
looked at all the various aspects I can think of but still no joy.  
Could someone point me in the right direction for resolving this.



The error "index tee cannot write" is the collateral damage of some
earlier problem.  The index tcp-connection got closed, probably because
of some other error.
See also:
  http://wiki.zmanda.com/index.php/Mesg_read:_Connection_reset_by_peer
which has a similar effect on the client.

Don't focus on this "index tee" but look earlier.
I'm interested in the error messages from the server-side as well.




I have included the sendbackup.debug (client) and a good part of the 
amandad (client) missing the repetitive entries.


Thanks,
Mike

sendbackup: debug 1 pid 14529 ruid 105 euid 105: start at Thu Aug  2 01:55:27 
2007
sendbackup: version 2.5.2
[...]
sendbackup: time 3.199:  91:  normal(|):   DUMP: Estimated 5852872 blocks 
(2857.85MB) on 0.04 tapes.


So you're trying to dump less than 3 Gbyte.



sendbackup: time 3.245:  91:  normal(|):   DUMP: Dumping (Pass III) 
[directories]
sendbackup: time 16237.077: index tee cannot write [Broken pipe]


Here the index tcp-connection got broken, proabably because the server
gave up on this host.  The real problem happened much earlier.
(more than 4 hours after startup -- a really patient server :-) )



sendbackup: time 16237.077: pid 14531 finish time Thu Aug  2 06:26:04 2007





vReading conf file "/etc/opt/slis/amanda/amanda-client.conf".

[...]


amandad: time 0.005: security_getdriver(name=BSD) returns ff2dd95c
amandad: version 2.5.2
amandad: time 0.005: build: VERSION="Amanda-2.5.2"
amandad: time 0.005:BUILT_DATE="Thu May 24 22:05:16 EDT 2007"


Did it work correct before?  Of do you struggle with the problem
until now?



[...]
amandad: time 0.024: accept recv REQ pkt:
<<<<<
SERVICE sendbackup
DUMP /zones  0 1970:1:1:0:0:0 OPTIONS |;auth=BSD;index;
 >>>>>
amandad: time 0.024: creating new service: sendbackup
DUMP /zones  0 1970:1:1:0:0:0 OPTIONS |;auth=BSD;index;

amandad: time 0.028: sending ACK pkt:
<<<<<
 >>>>>
amandad: time 0.028: dgram_send_addr(addr=30db0, dgram=ff2dddac)
amandad: time 0.028: (sockaddr_in *)30db0 = { 2, 731, 192.168.36.38 }
amandad: time 0.028: dgram_send_addr: ff2dddac->socket = 0
amandad: time 0.072: security_streaminit(stream=418f8, driver=ff2dd95c 
(BSD))

amandad: time 0.072: try_socksize: send buffer size is 65536
amandad: time 0.072: try_socksize: receive buffer size is 65536
amandad: time 0.072: stream_server: Could not bind to any port: Invalid argument



Here is the problem, I believe, although I do not know what is wrong.
Some syscall is giving "EINVAL" on one of the arguments.
Do you have some weird network setup?
e.g. do you have a lower limit then 64k for UDP-packets etc.



amandad: time 0.072: stream_server: Retrying entire range after 10 second delay.


... and so on ...

amandad: time 1800.842: security_seterror(handle=30d90, driver=ff2dd95c 
(BSD) error=can't create server stream: Invalid argument)
amandad: time 1800.842: couldn't open stream to server: can't create 
server stream: Invalid argument
amandad: time 1800.842: security_streaminit(stream=418f8, 
driver=ff2dd95c (BSD))


The upper layers repeating what the lower layers were saying...



amandad: time 1800.842: try_socksize: send buffer size is 65536
amandad: time 1800.842: try_socksize: receive buffer size is 65536
amandad: time 1800.842: stream_server: Could not bind to any port: Invalid 
argument
amandad: time 1800.842: stream_server: Retrying entire range after 10 second 
delay.


and starting again (not sure why??)



amandad: time 5402.381: stream_server: bind(in6addr_any) failed: Invalid 
argument


in6addr_any??  suddenly using IPV6??

I believe 2.5.2p1 fixed some IPV6 bugs.  (It has at least less bugs, so
better try that one.  I'm using it without problems.)






amandad: time 16247.142: sending REP pkt:
<<<<<
CONNECT DATA -1 MESG -1 INDEX -1
OPTIONS features=9ffe00;
 >>>>>


More trouble, proabably because of the previous ones.
Normally amandad should reply with the tcp-ports it has opened
for the three streams, and "-1" is of course not valid.



amandad: time 16247.142: dgram_send_addr(addr=49e40, dgram=ff2dddac)
amandad: time 16247.142: (sockaddr_in *)49e40 = { 2, 731, 192.168.36.38 }
amandad: time 16247.142: dgram_send_addr: ff2dddac->socket = 0
amandad: time 16257.150: timeout
amandad: time 16257.150: timeout waiting for ACK for our REP


But after 16000 seconds the server isn't expecting anything from this
host prob

index tee cannot write [Broken pipe]

2007-08-02 Thread Mike Gallant
I have an amanda (2.5.1p3) server (solaris10) with seven amanda  
clients (2.4.4) and a new amanda (2.5.2) client on new hardware.  I  
have been getting a "index tee cannot write [Broken pipe]" error  
message.  I have looked at all the various aspects I can think of but  
still no joy.  Could someone point me in the right direction for  
resolving this.


I have included the sendbackup.debug (client) and a good part of the  
amandad (client) missing the repetitive entries.


Thanks,
Mike

sendbackup: debug 1 pid 14529 ruid 105 euid 105: start at Thu Aug  2  
01:55:27 20

07
sendbackup: version 2.5.2
Reading conf file "/etc/opt/slis/amanda/amanda-client.conf".
Reading conf file "/etc/opt/slis/amanda/mwf/amanda-client.conf".
sendbackup: debug 1 pid 14529 ruid 105 euid 105: rename at Thu Aug  2  
01:55:27 2

007
  sendbackup req: |;auth=BSD;index;>

  parsed request as: program `DUMP'
 disk `/zones'
 device `/zones'
 level 0
 since 1970:1:1:0:0:0
 options `|;auth=BSD;index;'
sendbackup: start: bowen:/zones lev 0
sendbackup: time 0.015: dumping device '/dev/md/rdsk/d40' with 'ufs'
sendbackup: time 0.017: spawning /usr/sbin/ufsdump in pipeline
sendbackup: time 0.017: argument list: dump 0usf 1048576 - /dev/md/ 
rdsk/d40

sendbackup: time 0.019: started backup
sendbackup: time 0.052: started index creator: "/usr/sbin/ufsrestore - 
tvf - 2>&1

| sed -e '
s/^leaf[]*[0-9]*[   ]*\.//
t
/^dir[  ]/ {
s/^dir[ ]*[0-9]*[   ]*\.//
s%$%/%
t
}
d
'"
sendbackup: time 0.266:  91:  normal(|):   DUMP: Date of this level 0  
dump: Thu

Aug 02 01:55:27 2007
sendbackup: time 0.267:  91:  normal(|):   DUMP: Date of last level 0  
dump: the

epoch
sendbackup: time 0.267:  91:  normal(|):   DUMP: Dumping /dev/md/rdsk/ 
d40

(bowen:/zones) to standard output.
sendbackup: time 2.105:  91:  normal(|):   DUMP: Mapping (Pass I)  
[regular files

]
sendbackup: time 2.576:  91:  normal(|):   DUMP: Mapping (Pass II)  
[directories]
sendbackup: time 3.198:  91:  normal(|):   DUMP: Writing 32 Kilobyte  
records
sendbackup: time 3.199:  91:  normal(|):   DUMP: Estimated 5852872  
blocks (2857.

85MB) on 0.04 tapes.
sendbackup: time 3.245:  91:  normal(|):   DUMP: Dumping (Pass III)  
[directories

]
sendbackup: time 16237.077: index tee cannot write [Broken pipe]
sendbackup: time 16237.077: pid 14531 finish time Thu Aug  2 06:26:04  
2007



vReading conf file "/etc/opt/slis/amanda/amanda-client.conf".
amandad: time 0.005: security_getdriver(name=BSD) returns ff2dd95c
amandad: version 2.5.2
amandad: time 0.005: build: VERSION="Amanda-2.5.2"
amandad: time 0.005:BUILT_DATE="Thu May 24 22:05:16 EDT 2007"
amandad: time 0.005:CC="gcc"
amandad: time 0.005:CONFIGURE_COMMAND="'./configure' '-- 
prefix=/opt/slis
' '--with-user=amanda' '--with-group=amanda' '--with-libraries=/opt/ 
sfw/lib' '--
with-configdir=/etc/opt/slis/amanda' '--with-includes=/opt/sfw/ 
include' '--with-
krb5-security=/opt/slis/krb5' '--with-gnutar=/opt/slis/bin/tar' '-- 
with-portrang

e=5,50100'"
amandad: time 0.005: paths: bindir="/opt/slis/bin" sbindir="/opt/slis/ 
sbin"
amandad: time 0.005:libexecdir="/opt/slis/libexec" mandir="/ 
opt/slis/man

"
amandad: time 0.005:AMANDA_TMPDIR="/tmp/amanda"  
AMANDA_DBGDIR="/tmp/aman

da"
amandad: time 0.005:CONFIG_DIR="/etc/opt/slis/amanda"  
DEV_PREFIX="/dev/d

sk/"
amandad: time 0.005:RDEV_PREFIX="/dev/rdsk/" DUMP="/usr/sbin/ 
ufsdump"
amandad: time 0.005:RESTORE="/usr/sbin/ufsrestore"  
VDUMP=UNDEF VRESTORE=

UNDEF
amandad: time 0.005:XFSDUMP=UNDEF XFSRESTORE=UNDEF  
VXDUMP=UNDEF VXRESTOR

E=UNDEF
amandad: time 0.005:SAMBA_CLIENT="/usr/sfw/bin/smbclient"
amandad: time 0.005:GNUTAR="/opt/slis/bin/tar"  
COMPRESS_PATH="//bin/gzip

"
amandad: time 0.005:UNCOMPRESS_PATH="//bin/gzip" LPRCMD="// 
bin/lp"

amandad: time 0.005:MAILER="//bin/mailx"
amandad: time 0.005:listed_incr_dir="/var/opt/slis/amanda/ 
gnutar-lists"

amandad: time 0.005:DEFAULT_CONFIG="DailySet1"
amandad: time 0.005:NEED_STRSTR HAVE_SYSVSHM  
LOCKING=POSIX_FCNTL SETPGRP

_VOID
amandad: time 0.005:DEBUG_CODE AMANDA_DEBUG_DAYS=4  
BSD_SECURITY KRB5_SEC

URITY
amandad: time 0.005:RSH_SECURITY USE_AMANDAHOSTS  
CLIENT_LOGIN="amanda"

amandad: time 0.005:FORCE_USERID HAVE_GZIP COMPRESS_SUFFIX=".gz"
amandad: time 0.005:COMPRESS_FAST_OPT="

Re: sendbackup: index tee cannot write [Broken pipe]

2007-03-07 Thread Toralf Lund


A few days ago I had some backup problems that turned out to be caused 
by a hanging NFS mount, causing "sendsize" to lock up completely - see 
a separate post on this. Now I have sorted out this problem, and it 
seemed like amdump would once again start properly, but it turns out 
that the backup still doesn't run properly - this time it's 
"sendbackup" that gets into trouble. The full sendbackup...debug from 
a failed dump is included below. [ ... ]
Right. After looking a bit more at the logs etc. I'm no longer convinced 
that the error reported here is the main issue, and that it could be 
caused by a full disk - which I didn't think at first it might be, 
because I though I got the same behaviour for different configs with 
different holding space on different disks and I was convinced that they 
can't all have been filled up.


Anyhow, the thing is, I've been getting the following strange and 
somewhat annoying (because I just can't understand what "stranded on 
waitq" is supposed to mean) error message:



fileserv:/archive0 planner: [hmm, disk was 
stranded on waitq]


This turned out to be caused by various sendsize processes hanging 
because of NFS access problems. Or at least, so I thought, but after 
sorting out these issues, I still get the same message. So I looked at 
the logs again and noticed the "index tee cannot write", but maybe it's 
just a coincidence that this started occurring right now, and that the 
other message is caused by something else entirely?


- Toralf



sendbackup: index tee cannot write [Broken pipe]

2007-03-07 Thread Toralf Lund
A few days ago I had some backup problems that turned out to be caused 
by a hanging NFS mount, causing "sendsize" to lock up completely - see a 
separate post on this. Now I have sorted out this problem, and it seemed 
like amdump would once again start properly, but it turns out that the 
backup still doesn't run properly - this time it's "sendbackup" that 
gets into trouble. The full sendbackup...debug from a failed dump is 
included below.


Does anyone have any idea what may be causing this? What exactly is this 
"index tee" trying to write to (file, FIFO, socket...)? Is there 
anything extra that needs to be cleaned up after problems caused by 
hanging sendsize processes?



sendbackup: debug 1 pid 13900492 ruid 33 euid 33: start at Fri Mar  2 
16:58:31 2007

sendbackup: version 2.5.0p2
 parsed request as: program `GNUTAR'
disk `/scanner/golg'
device `/scanner/golg'
level 0
since 1970:1:1:0:0:0
options 
`|;auth=BSD;srvcomp-fast;index;exclude-list=.amanda.excludes;exclude-optional;'
sendbackup-gnutar: time 0.019: doing level 0 dump as listed-incremental 
to /usr/freeware/var/lib/amanda/gnutar-lists/fileserv_scanner_golg_0.new
sendbackup-gnutar: time 385956.158: doing level 0 dump from date: 
1970-01-01  0:00:00 GMT
sendbackup: time 385956.592: spawning /usr/freeware/libexec/runtar in 
pipeline
sendbackup: argument list: gtar --create --file - --directory 
/scanner/golg --one-file-system --listed-incremental 
/usr/freeware/var/lib/amanda/gnutar-lists/fileserv_scanner_golg_0.new 
--sparse --ignore-failed-read --totals --exclude-from 
/tmp/amanda/sendbackup._scanner_golg.20070307041107.exclude .
sendbackup: time 385956.603: started index creator: 
"/usr/freeware/bin/tar -tf - 2>/dev/null | sed -e 's/^\.//'"
sendbackup-gnutar: time 385956.605: /usr/freeware/libexec/runtar: pid 
14323097

sendbackup: time 385956.606: started backup
sendbackup: time 385969.818: index tee cannot write [Broken pipe]
sendbackup: time 385969.818: pid 14193336 finish time Wed Mar  7 
04:11:21 2007
sendbackup: time 385969.842: 117: strange(?): sendbackup: index tee 
cannot write [Broken pipe]

sendbackup: time 403598.554: error [/usr/freeware/bin/tar got signal 9]
sendbackup: time 403598.615: pid 13900492 finish time Wed Mar  7 
09:05:09 2007




Broken pipe on a big DLE

2006-08-24 Thread Toomas Aas

Hello!

I recently (this Sunday) upgraded my Amanda server to 2.5.0p2. Clients (other
than the server itself) are 2.4.4 and 2.4.5.

First two amdumps (Monday and Tuesday evenyng) ran fine, except that I forgot
to change the tape on Tuesday and had to run amflush on Wednesday. But the
Wednesday evening dump had two DLEs fail.

The reason of first one (tsensor.raad.tartu.ee /) I understand - it's caused
by a firewall on client not handling some TCP windowing stuff very well. This
has happened from time to time, even before upgrading the Amanda server, even
though the error message was a bit different.

The second one (heerold.raad.tartu.ee /home) is something new, however.
The situation with this DLE is that it is quite big and sometimes doesn't fit on
holding disk. With previous versions of Amanda it was started to dump on 
holdingdisk,
then the holdingdisk filled up, then there was a "broken pipe" message in 
syslog (but
not in Amanda report), then the DLE was restarted directly to tape and got 
backed up
successfully.

It seems that now, once the holdingdisk fills up, the DLE is not restarted 
directly
to tape, it just fails.

I think I might set "holdingdisk no" for this DLE since it often doesn't fit 
anyway,
but the behaviour of Amanda in this situation still doesn't seem quite correct 
to me.

 Amanda report 
Subject: TLV AMANDA MAIL REPORT FOR August 23, 2006
Date: Thu, 24 Aug 2006 03:32:58 +0300 (EEST)
From: [EMAIL PROTECTED] (System Operator)
To: [EMAIL PROTECTED]

*** THE DUMPS DID NOT FINISH PROPERLY!

These dumps were to tape PAEV33.
The next tape Amanda expects to use is: PAEV34.

FAILURE AND STRANGE DUMP SUMMARY:
  tsensor.raad.tartu.ee  /  lev 0  FAILED [cannot read header: got 0 
instead of 32768]
  heerold.raad.tartu.ee  /home  lev 0  FAILED [data write: Broken pipe]
  driver: FATAL Don't know how to send ABORT command to chunker
  chunker: FATAL error [bad command after RQ-MORE-DISK: "QUIT"]



(brought to you by Amanda version 2.5.0p2)

 Amanda report 

 sendbackup debug 
sendbackup: debug 1 pid 36024 ruid 1034 euid 1034: start at Thu Aug 24 01:48:29 
2006
/usr/local/libexec/amanda/sendbackup: version 2.4.4p4
  parsed request as: program `DUMP'
 disk `/home'
 device `/home'
 level 0
 since 1970:1:1:0:0:0
 options `|;auth=BSD;index;'
sendbackup: try_socksize: send buffer size is 65536
sendbackup: time 0.000: stream_server: waiting for connection: 0.0.0.0.11081
sendbackup: time 0.000: stream_server: waiting for connection: 0.0.0.0.11082
sendbackup: time 0.000: stream_server: waiting for connection: 0.0.0.0.11083
sendbackup: time 0.000: waiting for connect on 11081, then 11082, then 11083
sendbackup: time 0.016: stream_accept: connection from 194.126.106.100.11083
sendbackup: time 0.035: stream_accept: connection from 194.126.106.100.11084
sendbackup: time 0.061: stream_accept: connection from 194.126.106.100.11085
sendbackup: time 0.061: got all connections
sendbackup: time 0.062: dumping device '/dev/da0s1g' with 'ufs'
sendbackup: time 0.063: spawning /sbin/dump in pipeline
sendbackup: argument list: dump 0ushf 1048576 0 - /dev/da0s1g
sendbackup: time 0.065: started index creator: "/sbin/restore -tvf - 2>&1 | sed 
-e '
s/^leaf[]*[0-9]*[   ]*\.//
t
/^dir[  ]/ {
s/^dir[ ]*[0-9]*[   ]*\.//
s%$%/%
t
}
d
'"
sendbackup: time 0.068:  93:  normal(|):   DUMP: WARNING: should use -L when 
dumping live read-write filesystems!
sendbackup: time 0.068:  93:  normal(|):   DUMP: Date of this level 0 dump: Thu 
Aug 24 01:48:29 2006
sendbackup: time 0.069:  93:  normal(|):   DUMP: Date of last level 0 dump: the 
epoch
sendbackup: time 0.069:  93:  normal(|):   DUMP: Dumping /dev/da0s1g (/home) to 
standard output
sendbackup: time 0.767:  93:  normal(|):   DUMP: mapping (Pass I) [regular 
files]
sendbackup: time 2.676:  93:  normal(|):   DUMP: mapping (Pass II) [directories]
sendbackup: time 2.677:  93:  normal(|):   DUMP: estimated 17437291 tape blocks.
sendbackup: time 2.725:  93:  normal(|):   DUMP: dumping (Pass III) 
[directories]
sendbackup: time 26.967:  93:  normal(|):   DUMP: dumping (Pass IV) [regular 
files]
sendbackup: time 302.311:  93:  normal(|):   DUMP: 5.25% done, finished in 1:30 
at Thu Aug 24 03:23:45 2006
sendbackup: time 602.284:  93:  normal(|):   DUMP: 11.48% done, finished in 
1:17 at Thu Aug 24 03:15:36 2006
sendbackup: time 902.281:  93:  normal(|):   DUMP: 16.86% done, finished in 
1:13 at Thu Aug 24 03:17:28 2006
sendbackup: time 1202.285:  93:  normal(|):   DUMP: 22.56% done, finished in 
1:08 at Thu Aug 24 03:17:10 2006
sendbackup: time 1502.290:  93:  normal(|):   DUMP: 28.54% done, finished in 
1:02 at Thu Aug 24 03:16:07 2006
sendbackup: time 1802.283:  93:  normal(|):   DUMP: 

Re: broken pipe (WAS: Re: timeout while waiting for REP)

2006-07-06 Thread Paul Bijnens

Cameron Matheson schreef:


Well, Changing the etimeout/estimate-method in amanda.conf definitely
helped, but now I'm getting a broken-pipe error.  Here's the excerpt
from this morning's e-mail:

  aspapp2.tonservices.com   /opt/webapp/images  lev 0  FAILED [data
  timeout]
  aspapp2.tonservices.com   /opt/webapp/images  lev 0  FAILED [dump to
  tape failed]

  taper: no split_diskbuffer specified: using fallback split size of 10240kb to 
buffer aspapp2.tonservices.com:/opt/webapp/images.0 in-memory
  taper: tape DailySet1012 kb 59249888 fm 25 [OK]

That looks bad... I looked up what split_diskbuffer was in the
amanda.conf manpage, but I'm not sure how setting that to anything
different would help me (I think I'll just let it use its default until
I understand it better).  


When Amanda writes to tape, and it bumps into end of tape, it needs
to rewrite the whole chunk on the next tape  (because you cannot be
sure up to which byte did end up on tape, unless you write a filemark).

The "tape_splitsize" indicates how large such a tapechunk ending with
a filemark will be.  In the worst case Amanda will loose that
amount of tapecapacity at the end of a tape.

That chunk needs to be buffered somewhere.  If you're not using a 
holding disk, which would contain the complete dump file, then

Amanda can use some diskspace to buffer that chunk: some file in
the directory which you specify with "split_diskbuffer".

As a last resort it buffers the data in memory, in which case the tape
chunks now are only 10 Mbyte by default.

A good choice for directory as "split_diskbuffer" is your holdingdisk
directory.

And if your machine has plenty of RAM, then increase the 10 Mbyte 
"fallback_splitsize" too.


One remark:  Amanda reads from the diskbuffer with a mmap() as large
as the "tape_splitsize" value.  This means that you are limited with
your virtual memory how large that value can be.  If you specify it
too large, than Amanda will fall back to the fallback_splitsize, even
you have have 100 Gbyte free holdingdiskspace.



sendbackup: time 0.080: started backup
sendbackup: time 33267.859: index tee cannot write [Broken pipe]
sendbackup: time 33267.866: pid 15806 finish time Thu Jul  6 11:31:56
2006
sendbackup: time 33267.880: 117: strange(?): sendbackup: index tee
cannot write [Broken pipe]

[...]

It takes 9 hours, so I can see why that might time-out, but I'm not sure
why the pipe is closed... isn't the output from tar going over the
network the entire time?  How do I prevent this from occurring?



Could it be yet another symptom of this problem here:

http://wiki.zmanda.com/index.php/Amdump:_mesg_read:_Connection_reset_by_peer

Try this and see if it helps:

  echo 180 > /proc/sys/net/ipv4/tcp_keepalive_time


--
Paul Bijnens, XplanationTel  +32 16 397.511
Technologielaan 21 bus 2, B-3001 Leuven, BELGIUMFax  +32 16 397.512
http://www.xplanation.com/  email:  [EMAIL PROTECTED]
***
* I think I've got the hang of it now:  exit, ^D, ^C, ^\, ^Z, ^Q, F6, *
* quit,  ZZ, :q, :q!,  M-Z, ^X^C,  logoff, logout, close, bye,  /bye, *
* stop, end, F3, ~., ^]c, +++ ATH, disconnect, halt,  abort,  hangup, *
* PF4, F20, ^X^X, :D::D, KJOB, F14-f-e, F8-e,  kill -1 $$,  shutdown, *
* kill -9 1,  Alt-F4,  Ctrl-Alt-Del,  AltGr-NumLock,  Stop-A,  ...*
* ...  "Are you sure?"  ...   YES   ...   Phew ...   I'm out  *
***


broken pipe (WAS: Re: timeout while waiting for REP)

2006-07-06 Thread Cameron Matheson
Hi, I'm back:

On Wed, Jul 05, 2006 at 04:56:43PM -0600, Cameron Matheson wrote:
> On Thu, Jul 06, 2006 at 12:36:43AM +0200, Peter Kunst wrote:
> > try using a larger number (in seconds) for "etimeout" in your amanda.conf
> > 
> > try "estimate server" within the dumptype used for that DLE (which is 
> > something new since, let me guess, can't remember, 2.5.x ?)
> 
> Thanks, I had never changed the default 300 seconds which is no where
> near long enough for this directory (I would need more like an hour).
> That estimate server thing looks like just the ticket too (spending an
> hour each night estimating would not be ideal).

Well, Changing the etimeout/estimate-method in amanda.conf definitely
helped, but now I'm getting a broken-pipe error.  Here's the excerpt
from this morning's e-mail:

  aspapp2.tonservices.com   /opt/webapp/images  lev 0  FAILED [data
  timeout]
  aspapp2.tonservices.com   /opt/webapp/images  lev 0  FAILED [dump to
  tape failed]

  taper: no split_diskbuffer specified: using fallback split size of 10240kb to 
buffer aspapp2.tonservices.com:/opt/webapp/images.0 in-memory
  taper: tape DailySet1012 kb 59249888 fm 25 [OK]

That looks bad... I looked up what split_diskbuffer was in the
amanda.conf manpage, but I'm not sure how setting that to anything
different would help me (I think I'll just let it use its default until
I understand it better).  Anyway, Here's the excerpt from the sendbackup
file on my client (Sorry, this is a little long):

sendbackup-gnutar: time 0.000: doing level 0 dump as listed-incremental
to
/usr/local/var/amanda/gnutar-lists/aspapp2.tonservices.com_opt_webapp_images_0.new
sendbackup-gnutar: time 0.076: doing level 0 dump from date: 1970-01-01
0:00:00 GMT
sendbackup: time 0.079: started index creator: "/bin/gtar -tf -
2>/dev/null | sed -e 's/^\.//'"
sendbackup: time 0.079: spawning /usr/local/libexec/runtar in pipeline
sendbackup: argument list: gtar --create --file - --directory
/opt/webapp/images --one-file-system --listed-incremental
/usr/local/var/amanda/gnutar-lists/aspapp2.tonservices.com_opt_webapp_images_0.new
--sparse --ignore-failed-read --totals .
sendbackup-gnutar: time 0.080: /usr/local/libexec/runtar: pid 15808
sendbackup: time 0.080: started backup
sendbackup: time 33267.859: index tee cannot write [Broken pipe]
sendbackup: time 33267.866: pid 15806 finish time Thu Jul  6 11:31:56
2006
sendbackup: time 33267.880: 117: strange(?): sendbackup: index tee
cannot write [Broken pipe]
sendbackup: time 33267.880: 117: strange(?): sed: couldn't flush stdout:
Broken pipe
sendbackup: time 33268.445:  46:    size(|): Total bytes written: 20480
(20KiB, ?/s)
sendbackup: time 33268.445: 117: strange(?): gtar: -: Cannot write:
Broken pipe
sendbackup: time 33268.445: 117: strange(?): gtar: Error is not
recoverable: exiting now
sendbackup: time 33268.633: error [/bin/gtar returned 2]
sendbackup: time 33268.633: pid 15804 finish time Thu Jul  6 11:31:57
2006

It takes 9 hours, so I can see why that might time-out, but I'm not sure
why the pipe is closed... isn't the output from tar going over the
network the entire time?  How do I prevent this from occurring?

Thanks again,
Cameron Matheson


Re: [out of tape] [dump to tape failed] ["data write: Broken pipe"]

2004-09-01 Thread Reidar Nordin
Thank's a lot.. u put me on the right track...
After running dd as u said I knew what the problem was when it stopped at 2GB  
What I did was to change filesystem on the disk from Fat32 to EXT3... wich accepts a 
bigger filesize...

\\Reidar



 

- Original Message - 
From: "Frank Smith" <[EMAIL PROTECTED]>
To: "Reidar Nordin" <[EMAIL PROTECTED]>; <[EMAIL PROTECTED]>
Sent: Monday, August 30, 2004 2:33 AM
Subject: Re: [out of tape] [dump to tape failed] ["data write: Broken pipe"] 


> --On Sunday, August 29, 2004 15:17:50 +0200 Reidar Nordin <[EMAIL PROTECTED]> wrote:
> 
> > 
> >> Your 'tape' is full.  It doesn't necessarily mean the disk
> >> is full, it could be that your backups exceed your tape length.
> >> Look farther down in your report for the taper line and see if it
> >> wrote as many bytes as you specified for the length in your tapetype.
> >> If so, either increase your tape length (since you're really writing
> >> to a file instead of to a tape) or increase runtapes to write to
> >> multiple 'tapes'.
> > 
> > I have increased the length in tapetype from 10GB to 20GB but still got the same 
> > error as desc. before
> >  
> > The dumpsummary for just this server (awe) says:
> > ***
> > 
> >DUMPER STATSTAPER STATS 
> > HOSTNAME DISKL ORIG-KB OUT-KB COMP% MMM:SS  KB/s MMM:SS  KB/s
> > awe  /home   0 FAILED ---
> > awe  /httpd/conf 0 200 55  27.5   0:00 272.6   0:00 240.5
> > awe  -/local/adm 0  30  2   6.7   0:00   5.0   0:00   7.0
> > awe  -/lib/mysql 0  184160  22300  12.1   1:19 282.1   1:19 282.0
> > ***
> > 
> > From the end of my amanda log:
> > ***
> > INFO taper tape backup09 kb 2188064 fm 30 writing file: short write
> 
> Looks like it died around 2 GB.
> 
> > FAIL taper awe /home 20040828 0 [out of tape]
> > ERROR taper no-tape [[writing file: short write]]
> > FAIL driver awe /home 20040828 0 [dump to tape failed]
> > FAIL dumper awe /home 20040828 0 ["data write: Broken pipe"]
> >   sendbackup: start [awe:/home level 0]
> >   sendbackup: info BACKUP=/bin/tar
> >   sendbackup: info RECOVER_CMD=/usr/bin/gzip -dc |/bin/tar -f... -
> >   sendbackup: info COMPRESS_SUFFIX=.gz
> >   sendbackup: info end
> > FINISH driver date 20040828 time 2188.326
> > ***
> >  
> >> I'm guessing you're not using a holding disk.  While it's not as
> >> necessary using the file driver, it would allow multiple dumps to
> >> occur in parallel, which can shorten your backup window (using a
> >> different disk is recommended to lessen I/O contention)
> > 
> > I am using hdb1 for my backup's (mounted FAT32 disk)
> 
> I think this is your problem, I don't think FAT32 can't handle files
> greater than 4 GB (twice what you are erroring out at, but much smaller
> than your configured 'tape' size), but I think smbmount may have a 2 GB
> limit.
> 
> Can you create a bigger file using dd ?
> dd if=/dev/zero of=/mounted/fat32disk/testfile bs=1024 count=500
> 
> should create a 5 GB file in the path you specify with the of=, if it
> bombs at 2 GB then look into how it is mounted (could be samba or kernel
> problem).  If you need files bigger than 4 GB on windows it needs to
> be NTFS and not fat32.
> 
> > I am using hda2 as holding disk, this is my config in amanda.conf:
> > 
> > ***
> > holdingdisk hda2 {
> > comment "main holding disk"
> > directory "/usr/local/etc/amanda/holdingdisk"   # where the holding disk is
> > use -10 Mb  # how much space can we use on it
> > # a non-positive value means:
> > #use all space but that value
> > chunksize 10Gb  # size of chunk if you want big dump to be
> > # dumped on multiple files on holding disks
> > #  N Kb/Mb/Gb split images in chunks of size N
> > # The maximum value should be
> > # (MAX_FILE_SIZE - 1Mb)
> > #  0  same as INT_MAX bytes
> > }
> > ***
> > 
> > But the holdingdisk seems to be empty when Amanda tells me that some files may 
> > have been left in the holdingsdisk and that I should run amflush to flush them to 
> > tape.
> 
> Your backup was going direct to tape (FAIL driver awe /home 20040828 0 [dump to tape 
> failed]) so there was nothing in the holding disk to flush.  The 'flush to
> tape' message is always output on a failed backup whether there is anythng to
> flush or not (it probably should check, but it doesn't).
>Perhaps your reserve parameter is set to not allow level 0s to holdingdisk.
> 
> Frank
> 
> > 
> > \\Reidar
> > 
> >   
> >
> > 
> > 
> 
> 
> 
> 
> 





Re: [out of tape] [dump to tape failed] ["data write: Broken pipe"]

2004-08-29 Thread Frank Smith
--On Sunday, August 29, 2004 15:17:50 +0200 Reidar Nordin <[EMAIL PROTECTED]> wrote:

> 
>> Your 'tape' is full.  It doesn't necessarily mean the disk
>> is full, it could be that your backups exceed your tape length.
>> Look farther down in your report for the taper line and see if it
>> wrote as many bytes as you specified for the length in your tapetype.
>> If so, either increase your tape length (since you're really writing
>> to a file instead of to a tape) or increase runtapes to write to
>> multiple 'tapes'.
> 
> I have increased the length in tapetype from 10GB to 20GB but still got the same 
> error as desc. before
>  
> The dumpsummary for just this server (awe) says:
> ***
> 
>DUMPER STATSTAPER STATS 
> HOSTNAME DISKL ORIG-KB OUT-KB COMP% MMM:SS  KB/s MMM:SS  KB/s
> awe  /home   0 FAILED ---
> awe  /httpd/conf 0 200 55  27.5   0:00 272.6   0:00 240.5
> awe  -/local/adm 0  30  2   6.7   0:00   5.0   0:00   7.0
> awe  -/lib/mysql 0  184160  22300  12.1   1:19 282.1   1:19 282.0
> ***
> 
> From the end of my amanda log:
> ***
> INFO taper tape backup09 kb 2188064 fm 30 writing file: short write

Looks like it died around 2 GB.

> FAIL taper awe /home 20040828 0 [out of tape]
> ERROR taper no-tape [[writing file: short write]]
> FAIL driver awe /home 20040828 0 [dump to tape failed]
> FAIL dumper awe /home 20040828 0 ["data write: Broken pipe"]
>   sendbackup: start [awe:/home level 0]
>   sendbackup: info BACKUP=/bin/tar
>   sendbackup: info RECOVER_CMD=/usr/bin/gzip -dc |/bin/tar -f... -
>   sendbackup: info COMPRESS_SUFFIX=.gz
>   sendbackup: info end
> FINISH driver date 20040828 time 2188.326
> ***
>  
>> I'm guessing you're not using a holding disk.  While it's not as
>> necessary using the file driver, it would allow multiple dumps to
>> occur in parallel, which can shorten your backup window (using a
>> different disk is recommended to lessen I/O contention)
> 
> I am using hdb1 for my backup's (mounted FAT32 disk)

I think this is your problem, I don't think FAT32 can't handle files
greater than 4 GB (twice what you are erroring out at, but much smaller
than your configured 'tape' size), but I think smbmount may have a 2 GB
limit.

Can you create a bigger file using dd ?
dd if=/dev/zero of=/mounted/fat32disk/testfile bs=1024 count=500

should create a 5 GB file in the path you specify with the of=, if it
bombs at 2 GB then look into how it is mounted (could be samba or kernel
problem).  If you need files bigger than 4 GB on windows it needs to
be NTFS and not fat32.

> I am using hda2 as holding disk, this is my config in amanda.conf:
> 
> ***
> holdingdisk hda2 {
> comment "main holding disk"
> directory "/usr/local/etc/amanda/holdingdisk"   # where the holding disk is
> use -10 Mb  # how much space can we use on it
> # a non-positive value means:
> #use all space but that value
> chunksize 10Gb  # size of chunk if you want big dump to be
> # dumped on multiple files on holding disks
> #  N Kb/Mb/Gb split images in chunks of size N
> # The maximum value should be
> # (MAX_FILE_SIZE - 1Mb)
> #  0  same as INT_MAX bytes
> }
> ***
> 
> But the holdingdisk seems to be empty when Amanda tells me that some files may have 
> been left in the holdingsdisk and that I should run amflush to flush them to tape.

Your backup was going direct to tape (FAIL driver awe /home 20040828 0 [dump to tape 
failed]) so there was nothing in the holding disk to flush.  The 'flush to
tape' message is always output on a failed backup whether there is anythng to
flush or not (it probably should check, but it doesn't).
   Perhaps your reserve parameter is set to not allow level 0s to holdingdisk.

Frank

> 
> \\Reidar
> 
>   
>
> 
> 






Re: [out of tape] [dump to tape failed] ["data write: Broken pipe"]

2004-08-29 Thread Jon LaBadie
On Sun, Aug 29, 2004 at 03:17:50PM +0200, Reidar Nordin wrote:
> 
> > Your 'tape' is full.  It doesn't necessarily mean the disk
> > is full, it could be that your backups exceed your tape length.
> > Look farther down in your report for the taper line and see if it
> > wrote as many bytes as you specified for the length in your tapetype.
> > If so, either increase your tape length (since you're really writing
> > to a file instead of to a tape) or increase runtapes to write to
> > multiple 'tapes'.
> 
> I have increased the length in tapetype from 10GB to 20GB but still got the same 
> error as desc. before
>  
> The dumpsummary for just this server (awe) says:
> ***
> 
>DUMPER STATSTAPER STATS 
> HOSTNAME DISKL ORIG-KB OUT-KB COMP% MMM:SS  KB/s MMM:SS  KB/s
> awe  /home   0 FAILED ---
> awe  /httpd/conf 0 200 55  27.5   0:00 272.6   0:00 240.5
> awe  -/local/adm 0  30  2   6.7   0:00   5.0   0:00   7.0
> awe  -/lib/mysql 0  184160  22300  12.1   1:19 282.1   1:19 282.0
> ***
> 
> >From the end of my amanda log:
> ***
> INFO taper tape backup09 kb 2188064 fm 30 writing file: short write
> FAIL taper awe /home 20040828 0 [out of tape]
> ERROR taper no-tape [[writing file: short write]]
> FAIL driver awe /home 20040828 0 [dump to tape failed]
> FAIL dumper awe /home 20040828 0 ["data write: Broken pipe"]
>   sendbackup: start [awe:/home level 0]
>   sendbackup: info BACKUP=/bin/tar
>   sendbackup: info RECOVER_CMD=/usr/bin/gzip -dc |/bin/tar -f... -
>   sendbackup: info COMPRESS_SUFFIX=.gz
>   sendbackup: info end
> FINISH driver date 20040828 time 2188.326
> ***
>  
> > I'm guessing you're not using a holding disk.  While it's not as
> > necessary using the file driver, it would allow multiple dumps to
> > occur in parallel, which can shorten your backup window (using a
> > different disk is recommended to lessen I/O contention)
> 
> I am using hdb1 for my backup's (mounted FAT32 disk)
> I am using hda2 as holding disk, this is my config in amanda.conf:
> 
> ***
> holdingdisk hda2 {
> comment "main holding disk"
> directory "/usr/local/etc/amanda/holdingdisk"   # where the holding disk is
> use -10 Mb  # how much space can we use on it
> # a non-positive value means:
> #use all space but that value
> chunksize 10Gb  # size of chunk if you want big dump to be
> # dumped on multiple files on holding disks
> #  N Kb/Mb/Gb split images in chunks of size N
> # The maximum value should be
> # (MAX_FILE_SIZE - 1Mb)
> #  0  same as INT_MAX bytes
> }
> ***
> 
> But the holdingdisk seems to be empty when Amanda tells me that some files may have 
> been left in the holdingsdisk and that I should run amflush to flush them to tape.
> 
> \\Reidar
> 

I don't know the max file size on your FS's, but on some it is 2GB.
You could eliminate that as a possible problem on the holding disk
by specifying a "chunksize" as 1GB.   I don't know about the FAT32
you are using for your virtual 20GB tapes.  Is there a file size
limit on FAT32?

-- 
Jon H. LaBadie  [EMAIL PROTECTED]
 JG Computing
 4455 Province Line Road(609) 252-0159
 Princeton, NJ  08540-4322  (609) 683-7220 (fax)


Re: [out of tape] [dump to tape failed] ["data write: Broken pipe"]

2004-08-29 Thread Reidar Nordin

> Your 'tape' is full.  It doesn't necessarily mean the disk
> is full, it could be that your backups exceed your tape length.
> Look farther down in your report for the taper line and see if it
> wrote as many bytes as you specified for the length in your tapetype.
> If so, either increase your tape length (since you're really writing
> to a file instead of to a tape) or increase runtapes to write to
> multiple 'tapes'.

I have increased the length in tapetype from 10GB to 20GB but still got the same error 
as desc. before
 
The dumpsummary for just this server (awe) says:
***

   DUMPER STATSTAPER STATS 
HOSTNAME DISKL ORIG-KB OUT-KB COMP% MMM:SS  KB/s MMM:SS  KB/s
awe  /home   0 FAILED ---
awe  /httpd/conf 0 200 55  27.5   0:00 272.6   0:00 240.5
awe  -/local/adm 0  30  2   6.7   0:00   5.0   0:00   7.0
awe  -/lib/mysql 0  184160  22300  12.1   1:19 282.1   1:19 282.0
***

>From the end of my amanda log:
***
INFO taper tape backup09 kb 2188064 fm 30 writing file: short write
FAIL taper awe /home 20040828 0 [out of tape]
ERROR taper no-tape [[writing file: short write]]
FAIL driver awe /home 20040828 0 [dump to tape failed]
FAIL dumper awe /home 20040828 0 ["data write: Broken pipe"]
  sendbackup: start [awe:/home level 0]
  sendbackup: info BACKUP=/bin/tar
  sendbackup: info RECOVER_CMD=/usr/bin/gzip -dc |/bin/tar -f... -
  sendbackup: info COMPRESS_SUFFIX=.gz
  sendbackup: info end
FINISH driver date 20040828 time 2188.326
***
 
> I'm guessing you're not using a holding disk.  While it's not as
> necessary using the file driver, it would allow multiple dumps to
> occur in parallel, which can shorten your backup window (using a
> different disk is recommended to lessen I/O contention)

I am using hdb1 for my backup's (mounted FAT32 disk)
I am using hda2 as holding disk, this is my config in amanda.conf:

***
holdingdisk hda2 {
comment "main holding disk"
directory "/usr/local/etc/amanda/holdingdisk"   # where the holding disk is
use -10 Mb  # how much space can we use on it
# a non-positive value means:
#use all space but that value
chunksize 10Gb  # size of chunk if you want big dump to be
# dumped on multiple files on holding disks
#  N Kb/Mb/Gb split images in chunks of size N
# The maximum value should be
# (MAX_FILE_SIZE - 1Mb)
#  0  same as INT_MAX bytes
}
***

But the holdingdisk seems to be empty when Amanda tells me that some files may have 
been left in the holdingsdisk and that I should run amflush to flush them to tape.

\\Reidar

  
   






Re: [out of tape] [dump to tape failed] ["data write: Broken pipe"]

2004-08-29 Thread Frank Smith
--On Sunday, August 29, 2004 09:08:24 +0200 Reidar Nordin <[EMAIL PROTECTED]> wrote:

> I am running amanda in a redhat based network. I am backing up to disk
> instead of to tape because I had to many errors with the tapes.
> When I first started to back up to tape it worked fine but after a while
> i started getting error messages for just one server.
> I am backing up 7 servers and they all work fine except for one of them,
> the server that gives me the error is (awe), I get this error message
> after every run (the disk is not full):
>  
> These dumps were to tape backup09.
> *** A TAPE ERROR OCCURRED: [[writing file: short write]].
> Some dumps may have been left in the holding disk.
> Run amflush to flush them to tape.
> The next tape Amanda expects to use is: backup010.
> 
> FAILURE AND STRANGE DUMP SUMMARY:
>   awe/home lev 0 FAILED [out of tape]

Your 'tape' is full.  It doesn't necessarily mean the disk
is full, it could be that your backups exceed your tape length.
Look farther down in your report for the taper line and see if it
wrote as many bytes as you specified for the length in your tapetype.
If so, either increase your tape length (since you're really writing
to a file instead of to a tape) or increase runtapes to write to
multiple 'tapes'.

>   awe/home lev 0 FAILED [dump to tape failed]

I'm guessing you're not using a holding disk.  While it's not as
necessary using the file driver, it would allow multiple dumps to
occur in parallel, which can shorten your backup window (using a
different disk is recommended to lessen I/O contention)

Frank

>   awe/home lev 0 FAILED ["data write: Broken pipe"]
>  
>  
> /-- awe/home lev 0 FAILED ["data write: Broken pipe"]
> sendbackup: start [awe:/home level 0]
> sendbackup: info BACKUP=/bin/tar
> sendbackup: info RECOVER_CMD=/usr/bin/gzip -dc |/bin/tar -f... -
> sendbackup: info COMPRESS_SUFFIX=.gz
> sendbackup: info end
> \
>  
>  
> Anyone got a clue  
>  
>  
>  
> \\Reidar   
>  
>  
>  
>  
>  
>  
>  






[out of tape] [dump to tape failed] ["data write: Broken pipe"]

2004-08-29 Thread Reidar Nordin



I am running amanda in a redhat based network. I am 
backing up to disk instead of to tape because I had to many errors with the 
tapes.
When I first started to back up to tape it worked 
fine but after a while i started getting error messages for just one 
server.
I am backing up 7 servers and they all work fine 
except for one of them, the server that gives me the error is (awe), I get this 
error message after every run (the disk is not full):
 
These dumps were to tape backup09.*** A TAPE ERROR OCCURRED: [[writing 
file: short write]].Some dumps may have been left in the holding 
disk.Run amflush to flush them to tape.The next tape Amanda expects to 
use is: backup010.FAILURE AND STRANGE DUMP SUMMARY:
  awe    /home lev 0 FAILED 
[out of tape]  awe    /home lev 
0 FAILED [dump to tape failed]  
awe    /home lev 0 FAILED ["data write: 
Broken pipe"]
 
 
/-- awe    /home lev 0 FAILED ["data 
write: Broken pipe"]sendbackup: start [awe:/home level 0]sendbackup: 
info BACKUP=/bin/tarsendbackup: info RECOVER_CMD=/usr/bin/gzip -dc |/bin/tar 
-f... -sendbackup: info COMPRESS_SUFFIX=.gzsendbackup: info 
end\
 
 
Anyone got a clue  
 
 
 
\\Reidar 

 
 
 
 
 
 
 


RE: sendbackup: index tee cannot write [Broken pipe], Why?

2003-11-05 Thread Martinez, Michael
Wish I knew why it fixed it. In fact, when I added the holding-disk
stuff to disklist, it wasn't because I was trying to fix the index tee
problem, per se, it was simply because I noticed I had forgotten it. 

Then, subsequently noticed I had no more "broken tee" errors ...

Regards,
 
Michael Martinez
ISTM/CSREES
United States Department of Agriculture
---
This email is signed with my digital signature so that you may verify
the authenticity of the sender.

--> -Original Message-
--> From: Paul Bijnens [mailto:[EMAIL PROTECTED] 
--> Sent: Wednesday, November 05, 2003 10:19 AM
--> To: Martinez, Michael
--> Cc: [EMAIL PROTECTED]
--> Subject: Re: sendbackup: index tee cannot write [Broken pipe], Why?
--> 
--> 
--> Martinez, Michael wrote:
--> 
--> > I had this problem and fixed it by specifying 
--> "holding-disk -1 local" in
--> > disklist for the partition holding ~amanda on the tape server, and
--> > specifying "-1 local" for the rest of the tape server partitions.
-->  >
--> > --> John Grover wrote:
--> > --> 
--> > --> >   ? sendbackup: index tee cannot write [Broken pipe]
--> > --> >   ? index returned 1
--> > --> >   sendbackup: error [/usr/bin/tar got signal 13]
--> 
--> 
--> Clarifying -- I guess you had DLE's like:
--> 
--> host.domain   /amandaholding-disk   -1 local
--> host.domain   /  comp-user-tar  -1 local
--> host.domain   /home  comp-user-tar  -1 local
--> 
--> And with "comp-user-tar" instead of "holding-disk" on /amanda you
--> would get the above error message.
--> 
--> "Holding-disk" results in backup to tape immediatly, bypassing
--> the holdingdisk.  Any idea how that could influence the index tee?
--> Did the ~amanda also contain the holdingdisk? If yes, then it's
--> obvious you had to specify it, otherwise tar could create a huge
--> (infinite) backup image.  Eventually you would run out of 
--> holdingdisk
--> space.
--> 
--> Could it be triggered by the following sequence:
--> Tar is reading some huge files, maybe compressing them too.  This
--> takes a long time; in the meanwhile the "index" tee times out
--> in the server because the accompying index is only a few lines, and
--> is still buffered in the client.  But AFAIK there is no timeout
--> on the index-tee, see dumper.c, line 1260 etc.
--> Or there you should find some errors about "dup2", just 
--> before execlp
--> the "gzip --best" for the index writer, or the gzip --best
--> on the server crashed (how would you find out about this? there is
--> no shell to log such a message).  It could also be an output
--> error in "gzip --best" because your disk got full.  But then
--> you should have found an message in the logs (or maybe not, because
--> your disk was full :-) ).
--> 
--> Just trying to understand -- maybe we found some obscure bug.
--> 
--> 
--> -- 
--> Paul Bijnens, XplanationTel  
--> +32 16 397.511
--> Technologielaan 21 bus 2, B-3001 Leuven, BELGIUMFax  
--> +32 16 397.512
--> http://www.xplanation.com/  email:  
--> [EMAIL PROTECTED]
--> 
--> ***
--> * I think I've got the hang of it now:  exit, ^D, ^C, ^\, 
--> ^Z, ^Q, F6, *
--> * quit,  ZZ, :q, :q!,  M-Z, ^X^C,  logoff, logout, close, 
--> bye,  /bye, *
--> * stop, end, F3, ~., ^]c, +++ ATH, disconnect, halt,  
--> abort,  hangup, *
--> * PF4, F20, ^X^X, :D::D, KJOB, F14-f-e, F8-e,  kill -1 $$,  
--> shutdown, *
--> * kill -9 1,  Alt-F4,  Ctrl-Alt-Del,  AltGr-NumLock,  
--> Stop-A,  ...*
--> * ...  "Are you sure?"  ...   YES   ...   Phew ...   I'm 
--> out  *
--> 
--> ***
--> 
--> 
--> 


smime.p7s
Description: S/MIME cryptographic signature


Re: sendbackup: index tee cannot write [Broken pipe], Why?

2003-11-05 Thread Paul Bijnens
Martinez, Michael wrote:

I had this problem and fixed it by specifying "holding-disk -1 local" in
disklist for the partition holding ~amanda on the tape server, and
specifying "-1 local" for the rest of the tape server partitions.
>
--> John Grover wrote:
--> 
--> >   ? sendbackup: index tee cannot write [Broken pipe]
--> >   ? index returned 1
--> >   sendbackup: error [/usr/bin/tar got signal 13]


Clarifying -- I guess you had DLE's like:

host.domain   /amandaholding-disk   -1 local
host.domain   /  comp-user-tar  -1 local
host.domain   /home  comp-user-tar  -1 local
And with "comp-user-tar" instead of "holding-disk" on /amanda you
would get the above error message.
"Holding-disk" results in backup to tape immediatly, bypassing
the holdingdisk.  Any idea how that could influence the index tee?
Did the ~amanda also contain the holdingdisk? If yes, then it's
obvious you had to specify it, otherwise tar could create a huge
(infinite) backup image.  Eventually you would run out of holdingdisk
space.
Could it be triggered by the following sequence:
Tar is reading some huge files, maybe compressing them too.  This
takes a long time; in the meanwhile the "index" tee times out
in the server because the accompying index is only a few lines, and
is still buffered in the client.  But AFAIK there is no timeout
on the index-tee, see dumper.c, line 1260 etc.
Or there you should find some errors about "dup2", just before execlp
the "gzip --best" for the index writer, or the gzip --best
on the server crashed (how would you find out about this? there is
no shell to log such a message).  It could also be an output
error in "gzip --best" because your disk got full.  But then
you should have found an message in the logs (or maybe not, because
your disk was full :-) ).
Just trying to understand -- maybe we found some obscure bug.

--
Paul Bijnens, XplanationTel  +32 16 397.511
Technologielaan 21 bus 2, B-3001 Leuven, BELGIUMFax  +32 16 397.512
http://www.xplanation.com/  email:  [EMAIL PROTECTED]
***
* I think I've got the hang of it now:  exit, ^D, ^C, ^\, ^Z, ^Q, F6, *
* quit,  ZZ, :q, :q!,  M-Z, ^X^C,  logoff, logout, close, bye,  /bye, *
* stop, end, F3, ~., ^]c, +++ ATH, disconnect, halt,  abort,  hangup, *
* PF4, F20, ^X^X, :D::D, KJOB, F14-f-e, F8-e,  kill -1 $$,  shutdown, *
* kill -9 1,  Alt-F4,  Ctrl-Alt-Del,  AltGr-NumLock,  Stop-A,  ...*
* ...  "Are you sure?"  ...   YES   ...   Phew ...   I'm out  *
***



RE: sendbackup: index tee cannot write [Broken pipe], Why?

2003-11-05 Thread Martinez, Michael
I had this problem and fixed it by specifying "holding-disk -1 local" in
disklist for the partition holding ~amanda on the tape server, and
specifying "-1 local" for the rest of the tape server partitions.

Regards,
 
Michael Martinez
ISTM/CSREES
United States Department of Agriculture
---
This email is signed with my digital signature so that you may verify
the authenticity of the sender.

--> -Original Message-
--> From: Paul Bijnens [mailto:[EMAIL PROTECTED] 
--> Sent: Monday, November 03, 2003 11:17 AM
--> To: John Grover
--> Cc: [EMAIL PROTECTED]
--> Subject: Re: sendbackup: index tee cannot write [Broken pipe], Why?
--> 
--> 
--> John Grover wrote:
--> 
--> > I'm guessing that the timeout is caused by the failure of 
--> tar to write
--> > to the broken pipe, but the logs don't give me much to go on. I've
--> > scoured every lat resource I can find and still get these 
--> results. I'd
--> > be happy to send logs to anyone who would care to help me out with
--> > this.
--> 
--> 
--> >   ? sendbackup: index tee cannot write [Broken pipe]
--> >   ? index returned 1
--> >   sendbackup: error [/usr/bin/tar got signal 13]
--> 
--> This problem is signalled now and then on this list. People 
--> give lots of
--> tips to help/investigate/fix, but nobody ever came back and 
--> told what
--> he/she did to solve it.  If it ever got solved at all.
--> 
--> It seems it is the index-pipe on the server that is failing with
--> an (unknown) error.  This triggers a broken index pipe on 
--> the client.
--> We're not sure about that either.  That's why I suggested to
--> (temporarily) run without index for this DLE, and see if it gets any
--> better.
--> 
--> If this works fine, they you could try to create the index manually,
--> and see what's wrong, using commands on the server:
--> amrestore -p ... | gtar -tf -  > /tmp/the_index
--> Then with "gtar" running on the client:
--> ssh -l amanda amanda_server amrestore -p ... | gtar -tf 
--> - > /tmp/x
--> 
--> If you gzip the resulting index file and put in the correct 
--> place, with
--> the correct name, you have an index for amrecover too.
--> 
--> 
--> Also verify the gnutar version (on the client!);  best is 1.13.25 
--> (1.13.19 is probably ok too, but if you have it, I advice 
--> to upgrade 
--> anyway).  All other versions are suspicious.
--> Maybe the problem is with gzip (index files are ALWAYS compressed 
--> --best).  Verify that version too.
--> 
--> 
--> -- 
--> Paul Bijnens, XplanationTel  
--> +32 16 397.511
--> Technologielaan 21 bus 2, B-3001 Leuven, BELGIUMFax  
--> +32 16 397.512
--> http://www.xplanation.com/  email:  
--> [EMAIL PROTECTED]
--> 
--> ***
--> * I think I've got the hang of it now:  exit, ^D, ^C, ^\, 
--> ^Z, ^Q, F6, *
--> * quit,  ZZ, :q, :q!,  M-Z, ^X^C,  logoff, logout, close, 
--> bye,  /bye, *
--> * stop, end, F3, ~., ^]c, +++ ATH, disconnect, halt,  
--> abort,  hangup, *
--> * PF4, F20, ^X^X, :D::D, KJOB, F14-f-e, F8-e,  kill -1 $$,  
--> shutdown, *
--> * kill -9 1,  Alt-F4,  Ctrl-Alt-Del,  AltGr-NumLock,  
--> Stop-A,  ...*
--> * ...  "Are you sure?"  ...   YES   ...   Phew ...   I'm 
--> out  *
--> 
--> ***
--> 
--> 
--> 


smime.p7s
Description: S/MIME cryptographic signature


Re: sendbackup: index tee cannot write [Broken pipe], Why?

2003-11-03 Thread Paul Bijnens
John Grover wrote:

I'm guessing that the timeout is caused by the failure of tar to write
to the broken pipe, but the logs don't give me much to go on. I've
scoured every lat resource I can find and still get these results. I'd
be happy to send logs to anyone who would care to help me out with
this.


  ? sendbackup: index tee cannot write [Broken pipe]
  ? index returned 1
  sendbackup: error [/usr/bin/tar got signal 13]
This problem is signalled now and then on this list. People give lots of
tips to help/investigate/fix, but nobody ever came back and told what
he/she did to solve it.  If it ever got solved at all.
It seems it is the index-pipe on the server that is failing with
an (unknown) error.  This triggers a broken index pipe on the client.
We're not sure about that either.  That's why I suggested to
(temporarily) run without index for this DLE, and see if it gets any
better.
If this works fine, they you could try to create the index manually,
and see what's wrong, using commands on the server:
   amrestore -p ... | gtar -tf -  > /tmp/the_index
Then with "gtar" running on the client:
   ssh -l amanda amanda_server amrestore -p ... | gtar -tf - > /tmp/x
If you gzip the resulting index file and put in the correct place, with
the correct name, you have an index for amrecover too.
Also verify the gnutar version (on the client!);  best is 1.13.25 
(1.13.19 is probably ok too, but if you have it, I advice to upgrade 
anyway).  All other versions are suspicious.
Maybe the problem is with gzip (index files are ALWAYS compressed 
--best).  Verify that version too.

--
Paul Bijnens, XplanationTel  +32 16 397.511
Technologielaan 21 bus 2, B-3001 Leuven, BELGIUMFax  +32 16 397.512
http://www.xplanation.com/  email:  [EMAIL PROTECTED]
***
* I think I've got the hang of it now:  exit, ^D, ^C, ^\, ^Z, ^Q, F6, *
* quit,  ZZ, :q, :q!,  M-Z, ^X^C,  logoff, logout, close, bye,  /bye, *
* stop, end, F3, ~., ^]c, +++ ATH, disconnect, halt,  abort,  hangup, *
* PF4, F20, ^X^X, :D::D, KJOB, F14-f-e, F8-e,  kill -1 $$,  shutdown, *
* kill -9 1,  Alt-F4,  Ctrl-Alt-Del,  AltGr-NumLock,  Stop-A,  ...*
* ...  "Are you sure?"  ...   YES   ...   Phew ...   I'm out  *
***



sendbackup: index tee cannot write [Broken pipe], Why?

2003-11-03 Thread John Grover
Amanda Users List:

I've been getting the following error consistently on this client/disk
despite my best efforts to solve it. I've changed the timeout values on
the server, removed extra files from the file system and double checked
permissions and port numbers. The /var filesystem on this same client
backs up fine using the same configuration.

I'm guessing that the timeout is caused by the failure of tar to write
to the broken pipe, but the logs don't give me much to go on. I've
scoured every lat resource I can find and still get these results. I'd
be happy to send logs to anyone who would care to help me out with
this.

Thanks,
John Grover



/-- MyClient.my.d / lev 0 FAILED [data timeout]
sendbackup: start [MyClient.my.domain.name.here:/ level 0]
sendbackup: info BACKUP=/usr/bin/tar
sendbackup: info RECOVER_CMD=/usr/bin/tar -f... -
sendbackup: info end
? sendbackup: index tee cannot write [Broken pipe]
? index returned 1
sendbackup: error [/usr/bin/tar got signal 13]
\

Log excerpt:
...
WARNING planner Last full dump of MyClient.my.domain.name.here:/ on
tape  overwritten in 1 run.
...
FAIL dumper MyClient.my.domain.name.here / 20031101 0 [data timeout]
  sendbackup: start [MyClient.my.domain.name.here:/ level 0]
  sendbackup: info BACKUP=/usr/bin/tar
  sendbackup: info RECOVER_CMD=/usr/bin/tar -f... -
  sendbackup: info end
  ? gtar: Cannot add file
./usr/local/blackboard/docs/lib/session/41394: No such file or
directory
  ? sendbackup: index tee cannot write [Broken pipe]
  ? index returned 1
  sendbackup: error [/usr/bin/tar got signal 13]
SUCCESS dumper MyClient.my.domain.name.here /var 20031101 1 [sec 14.720
kb 2500 kps 169.8 orig-kb 2500]
SUCCESS taper MyClient.my.domain.name.here /var 20031101 1 [sec 2.310
kb 2500 kps 1081.8 {wr: writers 80 rdwait 0.000 wrwait 0.199 filemark
2.107}]
...




Re: backup over slow internet line, Broken pipe?

2003-10-27 Thread Paul Bijnens
Palle Girgensohn wrote:

I get Broken pipe all the time when backing up using tar. The backup 
client is at a different site, so backup is performed over a 2 Mbit/s 
line. While the 2Mb line is not top quality, using netstat -i 1 reveals 
that after a number of hours, almost no backup data is sent at all. 
Finally, it gives up with a "broken pipe" error. Looks to me that the 
tar program fails? Any ideas how get this working better? Can I force 
TCP connections instead of UDP? Anything else I've missed?
It is already using TCP. The UDP packet is only sent from server
to client to initiate the backup and the client replies with
a UDP packet with in it 2 or 3 TCP ports where the server should
connect to: one for the backup, one the error channel and if needed
one for the indexes.

sendbackup: time 0.088: spawning /usr/local/libexec/amanda/runtar in 
pipeline
sendbackup: argument list: gtar --create --file - --directory 
/usr/local/www/server --one-file-system --listed-inc
remental 
/usr/local/var/amanda/gnutar-lists/myclient.mydomain_usr_local_www_server_0
.new --sparse --ignore-failed
-read --totals --files-from 
/tmp/amanda/sendbackup._usr_local_www_server.20031023035208.include
sendbackup-gnutar: time 0.090: /usr/local/libexec/amanda/runtar: pid 11408
sendbackup: time 0.094: started index creator: "/usr/local/bin/gtar -tf 
- 2>/dev/null | sed -e 's/^\.//'"
sendbackup: time 45609.758: index tee cannot write [Broken pipe]
After 45609 seconds (>12 hours), the index stream got broken.


log, serverside:

STRANGE dumper myclient.mydomain /usr/local/www/server 0 [sec 42722.653 
kb 4990910 kps 116.8 orig-kb 4990910]
but 2887 seconds earlier the backup was finished.

Why is the index-stream still open?

There were similar messages on this list about a broken index-tee
but nobody ever mentioned a solution.
If it is really the index only, maybe try it without indexing,
to see how this behaves?  If this works, we have a better idea
where to look.
Also don't forget firewall settings... (something times out after maybe
an hour or so?)
--
Paul @ Home


backup over slow internet line, Broken pipe?

2003-10-27 Thread Palle Girgensohn
Hi!

I get Broken pipe all the time when backing up using tar. The backup client 
is at a different site, so backup is performed over a 2 Mbit/s line. While 
the 2Mb line is not top quality, using netstat -i 1 reveals that after a 
number of hours, almost no backup data is sent at all. Finally, it gives up 
with a "broken pipe" error. Looks to me that the tar program fails? Any 
ideas how get this working better? Can I force TCP connections instead of 
UDP? Anything else I've missed?

Thanks,
Palle
I use freebsd 4.8p3, amanda 2.4.4.

netusage 400 kbps   # maximum net bandwidth for Amanda, in KB per sec

dumpcycle 21 days   # the number of days in the normal dump cycle
runspercycle 12 # number  of  amdump runs in dumpcycle days
tapecycle 18 tapes  # the number of tapes in rotation
bumpsize 20 Mb  # minimum savings (threshold) to bump level 1 -> 2
bumpdays 2  # minimum days at each level
bumpmult 1.5# threshold = bumpsize * (level-1)**bumpmult
reserve 10  # reserve N % of holding disk for non-level-0 dumps
# Remember to switch between the 2 tape device
tapedev "/dev/nrsa1"# The no-rewind tape device to be uesd DLT
tapetype Adic   # DLT
...

define interface iponly {
   comment "iponly in the bergrum"
   use 2 mbps
}

and the disklist:
...
myclient.mydomain / inferior   -1 iponly
myclient.mydomain /usr/local/www/server tar_normal -1 iponly
...
All other backups run smooth, as well as the dump of the root disk.



log, on the client:

sendbackup: debug 1 pid 11405 ruid 10 euid 10: start at Thu Oct 23 03:52:08 
2003
/usr/local/libexec/amanda/sendbackup: version 2.4.4
 parsed request as: program `GNUTAR'
disk `/usr/local/www/server'
device `/usr/local/www/server'
level 0
since 1970:1:1:0:0:0
options 
`|;auth=bsd;index;include-list=.amanda.includes;'
sendbackup: try_socksize: send buffer size is 65536
sendbackup: time 0.000: stream_server: waiting for connection: 0.0.0.0.4531
sendbackup: time 0.000: stream_server: waiting for connection: 0.0.0.0.4532
sendbackup: time 0.000: stream_server: waiting for connection: 0.0.0.0.4533
sendbackup: time 0.000: waiting for connect on 4531, then 4532, then 4533
sendbackup: time 0.013: stream_accept: connection from 195.178.168.86.1382
sendbackup: time 0.020: stream_accept: connection from 195.178.168.86.1383
sendbackup: time 0.034: stream_accept: connection from 195.178.168.86.1384
sendbackup: time 0.034: got all connections
sendbackup-gnutar: time 0.048: doing level 0 dump as listed-incremental to 
/usr/local/var/amanda/gnutar-lists/myclient.mydomain_usr_local_www_server_0
.new
sendbackup-gnutar: time 0.065: doing level 0 dump from date: 1970-01-01 
0:00:00 GMT
sendbackup: time 0.088: spawning /usr/local/libexec/amanda/runtar in 
pipeline
sendbackup: argument list: gtar --create --file - --directory 
/usr/local/www/server --one-file-system --listed-inc
remental 
/usr/local/var/amanda/gnutar-lists/myclient.mydomain_usr_local_www_server_0
.new --sparse --ignore-failed
-read --totals --files-from 
/tmp/amanda/sendbackup._usr_local_www_server.20031023035208.include
sendbackup-gnutar: time 0.090: /usr/local/libexec/amanda/runtar: pid 11408
sendbackup: time 0.094: started index creator: "/usr/local/bin/gtar -tf - 
2>/dev/null | sed -e 's/^\.//'"
sendbackup: time 45609.758: index tee cannot write [Broken pipe]
sendbackup: time 45609.758: pid 11407 finish time Thu Oct 23 16:32:18 2003
sendbackup: time 45609.761: 124: strange(?): sendbackup: index tee cannot 
write [Broken pipe]
sendbackup: time 45609.763: error [/usr/local/bin/gtar got signal 13]
sendbackup: time 45609.763: pid 11405 finish time Thu Oct 23 16:32:18 2003



log, serverside:

STRANGE dumper myclient.mydomain /usr/local/www/server 0 [sec 42722.653 kb 
4990910 kps 116.8 orig-kb 4990910]
 sendbackup: start [myclient.mydomain:/usr/local/www/server level 0]
 sendbackup: info BACKUP=/usr/local/bin/gtar
 sendbackup: info RECOVER_CMD=/usr/local/bin/gtar -f... -
 sendbackup: info end
 ? gtar: ./libfiles/58992: Warning: Cannot stat: (null)
 ? gtar: ./libfiles/58993: Warning: Cannot stat: (null)
 | Total bytes written: 5110691840 (4.8GB, 117kB/s)
 sendbackup: size 4990910
 sendbackup: end



Broken pipe error

2003-10-21 Thread John Grover
I'm getting the following error from one of my client machines:

/-- wiley.my.d / lev 0 FAILED [data timeout]
sendbackup: start [wiley.my.domain:/ level 0]
sendbackup: info BACKUP=/usr/bin/tar
sendbackup: info RECOVER_CMD=/usr/bin/tar -f... -
sendbackup: info end
? sendbackup: index tee cannot write [Broken pipe]
? index returned 1
sendbackup: error [/usr/bin/tar got signal 13]
\

Another machine is configured identically but has no problem... Any
ideas?

Thanks
John Grover

John W Grover, Systems and Database Administrator
  Lake Michigan College
www.lakemichigancollege.edu



Broken pipe

2003-09-18 Thread Pascal Robert
Hi,

all of our machines are being backup except one machine.  This server 
(RH 8) have 3 partitions (/, /home, /var).  I can backup the root 
partition without problems, but the other two are saying this (in the 
/tmp/amanda/ logs):

sendbackup: started index creator: "/bin/tar -tf - 2>/dev/null | sed -e 
's/^\.//'"
index tee cannot write [Broken pipe]
sendbackup: pid 28683 finish time Thu Sep 18 00:18:14 2003
error [/bin/tar got signal 13, compress returned 1]

Any ideas ?



ANSWER: FATAL syncpipe_put: Broken pipe

2003-09-11 Thread Kurt Yoder

Kurt Yoder said:
> What does "taper: FATAL syncpipe_put: Broken pipe" mean? I've tried
> twice to flush a 26 GB dump image to tape, and got this message both
> times.

It was a hardware problem. Something about the motherboard and Linux
didn't get along. Switching to previous motherboard seems to have
fixed it.

-- 
Kurt Yoder
Sport & Health network administrator



Re: FATAL syncpipe_put: Broken pipe

2003-09-11 Thread Eric Siegerman
On Thu, Sep 11, 2003 at 12:09:24PM -0400, Kurt Yoder wrote:
> So what would be
> wrong with the dump image? Should I try manually uncompressing that
> image and seeing what's on it?

I doubt you'd learn anything.  As far as taper is concerned, I
believe the image is just an opaque stream of bytes.  If there's
something in the image's content that's killing taper -- and that
seems pretty farfetched! -- it'll be in the compressed byte
stream as it is in the holding disk.

A problem in the uncompressed image would affect the lower-level
restore tool (restore, gtar, etc.), but as far as Amanda itself
is concerned, the only effect I can imagine would be on index
generation, since that depends on the lower-level restorer.  But
indexing shouldn't come into play during amflush.

--

|  | /\
|-_|/  >   Eric Siegerman, Toronto, Ont.[EMAIL PROTECTED]
|  |  /
When I came back around from the dark side, there in front of me would
be the landing area where the crew was, and the Earth, all in the view
of my window. I couldn't help but think that there in front of me was
all of humanity, except me.
- Michael Collins, Apollo 11 Command Module Pilot



Re: FATAL syncpipe_put: Broken pipe

2003-09-11 Thread Kurt Yoder

Paul Bijnens said:
> Kurt Yoder wrote:
>
>> My amdump log file is quite long, so here's the part that seems
>> most
>> relevant:
>  >  ...
>
> It seems it is the taper writer that is somehow crashing, without
> telling anybody.  Do you find a core file in the amanda home dir
> or in /tmp/amanda ?

Nope

[EMAIL PROTECTED]:/tmp/amanda# find / -name core
/dev/core
/proc/sys/net/core
/old/dev/core


> Just a wild guess: is the tapecapacity < 26Gbyte?  You're still
> running
> version 2.4.2p2 and I remember somewhere someone changed something
> so that a file that fails twice while taping, is not a candidate
> anymore
> (avoiding that one file is tried on each tape and trashes all tapes
> in the loader).

Tape and drive capacity is 40 GB uncompressed. So what would be
wrong with the dump image? Should I try manually uncompressing that
image and seeing what's on it?

> If you're brave, upgrade to 2.4.4p1 (or the latest 2.4.4 snapshot
> from
> http://www.iro.umontreal.ca/~martinea/amanda ) and if the problem
> persists, recompile amanda with TAPER_DEBUG=1.

I'd like to avoid that if at all possible...

-- 
Kurt Yoder
Sport & Health network administrator


Re: FATAL syncpipe_put: Broken pipe

2003-09-11 Thread Kurt Yoder

Paul Bijnens said:
> Kurt Yoder wrote:
>
>> What does "taper: FATAL syncpipe_put: Broken pipe" mean? I've
>> tried
>> twice to flush a 26 GB dump image to tape, and got this message
>> both
>> times.
>
> Any other messages in the file "amdump.*" and the LOG.datestamp.lvl.
> file?
> There are two tapers, a reader and a writer.  Maybe there are some
> error messages from one of those.

My amdump log file is quite long, so here's the part that seems most
relevant:

driver: finished-cmd time 9464.088 dumper0 dumped
borneo.shcorp.com:/home/groups/graphics_photos
driver: send-cmd time 9464.088 to taper: FILE-WRITE 00-00026
/backups/amandadisk/holding/20030911/borneo.shcorp.com._home_groups_graphics__photos.0
borneo.shcorp.com /home/groups/graphics_photos 0 20030911
driver: state time 9464.088 free kps: 36000 space: 55661264 taper:
writing idle-dumpers: 4 qlen tapeq: 0 runq: 0 roomq: 0 wakeup: 86400
driver-idle: not-idle
driver: interface-state time 9464.088 if : free 36000
driver: hdisk-state time 9464.088 hdisk 0: free 55661264 dumpers 0
syncpipe_put: Broken pipe
taper: pid 6574 finish time Thu Sep 11 03:31:12 2003
driver: result time 10870.550 from taper:
dump of driver schedule before start degraded mode:

Then here's my amflush log file:

driver: send-cmd time 0.000 to taper: START-TAPER 20030911
taper: pid 7964 executable taper version 2.4.2p2
changer: opening pipe to: /usr/lib/amanda/chg-multi -info
changer: got exit: 0 str: 1 1 1
changer_query: changer return was 1 1
changer_query: searchable = 0
changer_find: looking for CDS1tape07 changer is searchable = 0
changer: opening pipe to: /usr/lib/amanda/chg-multi -slot current
changer: got exit: 0 str: 1 /dev/nst0
taper: slot 1: date 20030811 label CDS1tape07 (exact label match)
taper: read label `CDS1tape07' date `20030811'
taper: wrote label `CDS1tape07' date `20030911'
driver: send-cmd time 11.544 to taper: FILE-WRITE 00-1
/backups/amandadisk/holding/20030911/borneo.shcorp.com._home_groups_graphics__photos.0
borneo.shcorp.com /home/groups/graphics_photos 0 20030911
syncpipe_put: Broken pipe
taper: pid 7965 finish time Thu Sep 11 09:18:40 2003
driver: send-cmd time 44.205 to taper: QUIT
writing taper command: Broken pipe

and here's my log.20030911.1:

START amflush date 20030911
START taper datestamp 20030911 label CDS1tape07 tape 0
FATAL taper syncpipe_put: Broken pipe
WARNING amflush
/backups/amandadisk/holding/20030911/borneo.shcorp.com._home_groups_graphics__photos.0:
taper error, leaving file on disk
WARNING amflush Could not rmdir
/backups/amandadisk/holding/20030911.  Check for cruft.
FINISH amflush date 20030911 time 44.205


-- 
Kurt Yoder
Sport & Health network administrator


FATAL syncpipe_put: Broken pipe

2003-09-11 Thread Kurt Yoder
What does "taper: FATAL syncpipe_put: Broken pipe" mean? I've tried
twice to flush a 26 GB dump image to tape, and got this message both
times.

-- 
Kurt Yoder
Sport & Health network administrator



AMANDA and sencrypt/sst -> dump: Broken pipe

2003-02-09 Thread marc . bigler
Hello,

I am using an AMANDA 2.4.3 server patched with the sencrypt script from
http://cns.utoronto.ca/~pkern/stuff/amanda-patch/ and also using sst to do
the encryption from the same author. I am using exactly the same
configuration on both the server and the client.

The problem is that somehow dump returns Broken pipe with I turn sencrypt
to on, if I don't use sencrypt it works pefectly. Here is the output of the
sendbackup logfile on the client:

sendbackup: debug 1 pid 21255 ruid 104 euid 104: start at Sun Feb  9
19:44:14 2003
/opt/amanda/libexec/sendbackup: version 2.4.3
  parsed request as: program `DUMP'
 disk `/'
 device `/'
 level 1
 since 2003:2:8:23:14:6
 options `|;auth=bsd;sencrypt;compress-best;'
sendbackup: try_socksize: send buffer size is 65536
sendbackup: time 0.000: stream_server: waiting for connection:
0.0.0.0.44864
sendbackup: time 0.000: stream_server: waiting for connection:
0.0.0.0.44865
sendbackup: time 0.001: waiting for connect on 44864, then 44865
sendbackup: time 0.039: stream_accept: connection from X.X.X.X.57784
sendbackup: time 0.059: stream_accept: connection from X.X.X.X.51050
sendbackup: time 0.060: got all connections
sendbackup: time 0.060: spawning /local/bin/sst in pipeline
sendbackup: argument list: /local/bin/sst -celv
sendbackup: time 0.061: spawning /bin/gzip in pipeline
sendbackup: argument list: /bin/gzip --best
sendbackup-dump: time 0.061: pid 21258: /bin/gzip --best
sendbackup: time 0.063: dumping device '/dev/hda2' with 'ext3'
sendbackup: time 0.064: spawning /sbin/dump in pipeline
sendbackup: argument list: dump 1usf 1048576 - /dev/hda2
sendbackup: time 0.904:  91:  normal(|):   DUMP: Date of this level 1 dump:
Sun Feb  9 19:44:14 2003
sendbackup: time 0.907:  91:  normal(|):   DUMP: Date of last level 0 dump:
Sat Feb  8 23:27:22 2003
sendbackup: time 0.909:  91:  normal(|):   DUMP: Dumping /dev/hda2 (/) to
standard output
sendbackup: time 0.911:  91:  normal(|):   DUMP: Exclude ext3 journal inode
8
sendbackup: time 1.112:  91:  normal(|):   DUMP: Label: /
sendbackup: time 1.115:  91:  normal(|):   DUMP: mapping (Pass I) [regular
files]
sendbackup: time 3.663:  91:  normal(|):   DUMP: mapping (Pass II)
[directories]
sendbackup: time 4.818:  91:  normal(|):   DUMP: estimated 139205 tape
blocks.
sendbackup: time 4.821:  91:  normal(|):   DUMP: Volume 1 started with
block 1 at: Sun Feb  9 19:44:19 2003
sendbackup: time 4.864:  91:  normal(|):   DUMP: dumping (Pass III)
[directories]
sendbackup: time 5.756:  91:  normal(|):   DUMP: Broken pipe
sendbackup: time 5.758:  91:  normal(|):   DUMP: The ENTIRE dump is
aborted.
sendbackup: time 6.059: error [compress got signal 13, sencrypt returned
33, /sbin/dump returned 3]
sendbackup: time 6.059: pid 21255 finish time Sun Feb  9 19:44:20 2003


Does anyone know what could be wrong ? If you need more details or other
log files let me know, I would be pleased to provide them...

Thanks for the help !

Regards




Error 32 (Broken pipe)

2002-02-28 Thread Benjamin Gross

I'm trying to restore a backup of a client from tape.  I'd like to first
read archive file off tape onto amanda server, then move the file onto
the amanda client and run it through restore.

I typed:

amrestore /dev/nst1 samba /dev/hdc1


after a few minutes of restoring I get:

Error 32 (Broken pipe) offset 859930624+32768, wrote 0
amrestore: pipe reader has quit in middle of file
amrestore: skipping ahead to start of next file, please wait...

Can someone please help ?

Regards,
Ben



index tee cannot write [Broken pipe]

2002-02-26 Thread Hikawa
Hi,

I am using amanda version 2.4.2p2.

I am getting error
in /tmp/amanda/sendbackup.20020226160838.debug .

The content of sendbackup.20020226160838.debug is
the following.

sendbackup-gnutar: doing level 0 dump from date: 1970-01-01  0:00:00 GMT
sendbackup: spawning /usr/local/libexec/runtar in pipeline
sendbackup: argument list: gtar --create --file - --directory
/contents --one-file-system --list
ed-incremental
/usr/local/var/amanda/gnutar-lists/aspone_contents_0.new --sparse --ignore-f
ailed
-read --totals .
sendbackup-gnutar: /usr/local/libexec/runtar: pid 28450
sendbackup: started index creator: "/bin/gtar -tf - 2>/dev/null | sed -e
's/^\.//'"
index tee cannot write [Broken pipe]
sendbackup: pid 28448 finish time Tue Feb 26 17:49:17 2002

Backup is fail.
I can not understand this error.

Please help me.

Best regards,
Masafumi Hikawa


Broken Pipe errors

2002-02-25 Thread Matt Galer

I recently added a new machine to my backups, and seem to be having a
problem with a couple of file systems with "index tee cannot write [Broken
pipe]" errors.  

Client is Solaris 7
Server is Solaris 7
amanda version 2.4.2p2
gnutar 1.13.19
firewall between the two servers, but no security restrictions (as in all
other backups work fine).  Server is NAT'ed at firewall

I've run some tests just backing one of those filesystems up, and no matter
how I fiddle with the dumptype (thinking compression in particular might be
part of the problem) I cannot get this error to go away.  In test mode I'm
trying to dump only to holding disk which has 11.5 GB free.  The file system
in question is only ~1.5 GBs.  Larger filesystems on the same host can be
backed up with the same dumptype parameters with no problems.

When I was watching the backup being performed with "amstatus" it showed
data being transferred, and the correct file was being created and updated
on the server.  I think when it stopped updating, the size sent reported by
amstatus was a few KB (32?) over the estimated backup size (didnt write the
data down like a fool) - I realize this may or may not mean a thing, but
just throwing it out there.
 
Here are some (hopefully!) appropriate log files (they dont tell me too
much):

=
log.20020225.0 from server
=
START planner date 20020225
START driver date 20020225
ERROR taper no-tape [no tape online]
FINISH planner date 20020225
STATS driver startup time 2.472
FAIL dumper rnbuyer /export/home 0 [mesg read: Connection timed out]
  sendbackup: start [rnbuyer:/export/home level 0]
  sendbackup: info BACKUP=/usr/local/bin/tar
  sendbackup: info RECOVER_CMD=/usr/local/bin/tar -f... -
  sendbackup: info end
FINISH driver date 20020225 time 7744.120

=
amdump.1 from server
=
amdump: start at Mon Feb 25 10:38:00 EST 2002
planner: pid 5123 executable /usr/local/libexec/planner version 2.4.2p2
planner: build: VERSION="Amanda-2.4.2p2"
planner:BUILT_DATE="Mon Jul 30 12:22:45 EDT 2001"
planner:BUILT_MACH="SunOS utl-atl-05 5.7 Generic_106541-15 sun4u
sparc SUNW,Ultra-4"
planner:CC="gcc"
planner: paths: bindir="/usr/local/bin" sbindir="/usr/local/sbin"
planner:libexecdir="/usr/local/libexec" mandir="/usr/local/man"
planner:AMANDA_TMPDIR="/tmp/amanda" AMANDA_DBGDIR="/tmp/amanda"
planner:CONFIG_DIR="/usr/local/etc/amanda" DEV_PREFIX="/dev/dsk/"
planner:RDEV_PREFIX="/dev/rdsk/" DUMP="/usr/sbin/ufsdump"
planner:RESTORE="/usr/sbin/ufsrestore"
planner:SAMBA_CLIENT="/usr/local/samba/bin/smbclient"
planner:GNUTAR="/usr/local/bin/tar"
planner:COMPRESS_PATH="/usr/local/bin/gzip"
planner:UNCOMPRESS_PATH="/usr/local/bin/gzip"
planner:MAILER="/usr/bin/mailx"
planner:listed_incr_dir="/usr/local/var/amanda/gnutar-lists"
planner: defs:  DEFAULT_SERVER="utl-atl-06" DEFAULT_CONFIG="DailySet1"
planner:DEFAULT_TAPE_SERVER="utl-atl-06"
planner:DEFAULT_TAPE_DEVICE="/dev/null" HAVE_MMAP HAVE_SYSVSHM
planner:LOCKING=POSIX_FCNTL SETPGRP_VOID DEBUG_CODE
planner:AMANDA_DEBUG_DAYS=4 BSD_SECURITY USE_AMANDAHOSTS
planner:CLIENT_LOGIN="siteops" FORCE_USERID HAVE_GZIP
planner:COMPRESS_SUFFIX=".gz" COMPRESS_FAST_OPT="--fast"
planner:COMPRESS_BEST_OPT="--best" UNCOMPRESS_OPT="-dc"
planner: dgram_bind: socket bound to 0.0.0.0.603
READING CONF FILES...
startup took 0.010 secs

SETTING UP FOR ESTIMATES...
setting up estimates for rnbuyer:/export/home
rnbuyer:/export/home overdue 11744 days for level 0
setup_estimate: rnbuyer:/export/home: command 4, options:
last_level -1 next_level0 -11744 level_days 0
getting estimates 0 (0) -1 (-1) -1 (-1)
setting up estimates took 0.000 secs

GETTING ESTIMATES...
driver: pid 5122 executable /usr/local/libexec/driver version 2.4.2p2
driver: send-cmd time 0.005 to taper: START-TAPER 20020225
taper: pid 5124 executable taper version 2.4.2p2
driver: started dumper0 pid 5126
dumper: dgram_bind: socket bound to 0.0.0.0.606
dumper: pid 5126 executable dumper version 2.4.2p2, using port 606
dumper: dgram_bind: socket bound to 0.0.0.0.607
dumper: pid 5127 executable dumper version 2.4.2p2, using port 607
driver: started dumper1 pid 5127
driver: started dumper2 pid 5128
driver: started dumper3 pid 5129
driver: started dumper4 pid 5130
driver: started dumper5 pid 5131
dumper: dgram_bind: socket bound to 0.0.0.0.608
dumper: pid 5128 executable dumper version 2.4.2p2, using port 608
dumper: dgram_bind: socket bound to 0.0.0.0.610
dumper: pid 5130 exec

RE: broken pipe?!

2001-09-19 Thread Chris Noon

> -Original Message-
> From: John R. Jackson [mailto:[EMAIL PROTECTED]]
> Sent: Monday, September 17, 2001 3:30 PM
> To: Chris Noon
> Cc: [EMAIL PROTECTED]
> Subject: Re: broken pipe?!
>
>
> >...  Unfortunately, another dump ran on
> >that client since this error, so sendbackup.debug doesn't refer to this
> >dump.
>
> Sigh.  FYI, if you upgrade that client to 2.4.2p2 (you would not have
> to do everything), the debug files will be kept separate.  Or you could
> rebuild that client using --with-pid-debug-files, but that will require
> you also remember to clean up /tmp/amanda on occasion.
>
> John R. Jackson, Technical Software Specialist, [EMAIL PROTECTED]

I've gotten this error again, and this time I've got the sendbackup.debug
file from the client.  I don't see any glaring problems with it, but here it
is...

**
sendbackup: debug 1 pid 10379 ruid 5095 euid 5095 start time Wed Sep 19
11:29:38 2001
/usr/local/amanda-2.4.2/libexec/sendbackup: got input request: DUMP c1t4d0s7
0 1970:1:1:0:0:0 OPTIONS |;bsd-auth;srvcomp-fast;
index;
  parsed request as: program `DUMP' disk `c1t4d0s7' lev 0 since
1970:1:1:0:0:0 opt `|;bsd-auth;srvcomp-fast;index;'
  waiting for connect on 57695, then 57696, then 57697
  got all connections
sendbackup: started index creator: "/usr/sbin/ufsrestore -tvf - 2>&1 |
sed -e '
s/^leaf[]*[0-9]*[   ]*\.//
t
/^dir[  ]/ {
s/^dir[ ]*[0-9]*[   ]*\.//
s%$%/%
t
}
d
'"
sendbackup: spawning "/usr/sbin/ufsdump" in pipeline
sendbackup: argument list: "dump" "0usf" "1048576" "-" "/dev/rdsk/c1t4d0s7"
sendbackup: index pipe returned 36096
sendbackup: pid 10379 finish time Wed Sep 19 14:05:46 2001

**

Thanks in advance again.

--Chris




Re: broken pipe?!

2001-09-17 Thread John R. Jackson

>...  Unfortunately, another dump ran on
>that client since this error, so sendbackup.debug doesn't refer to this
>dump.

Sigh.  FYI, if you upgrade that client to 2.4.2p2 (you would not have
to do everything), the debug files will be kept separate.  Or you could
rebuild that client using --with-pid-debug-files, but that will require
you also remember to clean up /tmp/amanda on occasion.

John R. Jackson, Technical Software Specialist, [EMAIL PROTECTED]



RE: broken pipe?!

2001-09-17 Thread Chris Noon

Yes it's solaris, using amanda-2.4.1p1.  Unfortunately, another dump ran on
that client since this error, so sendbackup.debug doesn't refer to this
dump.

-Original Message-
From: John R. Jackson [mailto:[EMAIL PROTECTED]]
Sent: Monday, September 17, 2001 1:22 PM
To: Chris Noon
Cc: [EMAIL PROTECTED]
Subject: Re: broken pipe?!


>I keep getting the following error whenever I run a Level 0 on this one
>disk.  ...
>|   DUMP: DUMP IS DONE
>|   DUMP: Level 0 dump on Wed Sep 12 09:37:15 2001
>? Broken Pipe

I take it the client is Solaris?  What version of Amanda?  What's in
/tmp/amanda/sendbackup*debug on the client?

John R. Jackson, Technical Software Specialist, [EMAIL PROTECTED]




Re: broken pipe?!

2001-09-17 Thread John R. Jackson

>I keep getting the following error whenever I run a Level 0 on this one
>disk.  ...
>|   DUMP: DUMP IS DONE
>|   DUMP: Level 0 dump on Wed Sep 12 09:37:15 2001
>? Broken Pipe

I take it the client is Solaris?  What version of Amanda?  What's in
/tmp/amanda/sendbackup*debug on the client?

John R. Jackson, Technical Software Specialist, [EMAIL PROTECTED]



broken pipe?!

2001-09-17 Thread Chris Noon

I keep getting the following error whenever I run a Level 0 on this one
disk.  Everything seems to be fine with the dump, as I am able to
successfully restore from it.  But this error is driving me mad, and I am
getting nowhere on my own.  Any ideas?

*
FAILED AND STRANGE DUMP DETAILS:

/-- kabru  c1t4d0s7 lev 0 STRANGE
sendbackup: start [kabru:c1t4d0s7 level 0]
sendbackup: info BACKUP=/usr/sbin/ufsdump
sendbackup: info RECOVER_CMD=/usr/sbin/ufsrestore -f... -
sendbackup: info end
|   DUMP: Writing 32 Kilobyte records
|   DUMP: Date of this level 0 dump: Wed Sep 12 09:37:15 2001
|   DUMP: Date of last level 0 dump: the epoch
|   DUMP: Dumping /dev/rdsk/c1t4d0s7 (kabru.domain.com:/usr2) to standard
output.
|   DUMP: Mapping (Pass I) [regular files]
|   DUMP: Mapping (Pass II) [directories]
|   DUMP: Estimated 10822366 blocks (5284.36MB) on 0.08 tapes.
|   DUMP: Dumping (Pass III) [directories]
|   DUMP: Dumping (Pass IV) [regular files]
|   DUMP: 7.68% done, finished in 2:00
|   DUMP: 14.26% done, finished in 2:00
|   DUMP: 21.50% done, finished in 1:49
|   DUMP: 27.63% done, finished in 1:45
|   DUMP: 34.33% done, finished in 1:36
|   DUMP: 41.03% done, finished in 1:26
|   DUMP: 47.23% done, finished in 1:18
|   DUMP: 53.78% done, finished in 1:09
|   DUMP: 60.59% done, finished in 0:58
|   DUMP: 66.98% done, finished in 0:49
|   DUMP: 73.76% done, finished in 0:39
|   DUMP: 80.72% done, finished in 0:28
|   DUMP: 87.16% done, finished in 0:19
|   DUMP: 93.48% done, finished in 0:09
|   DUMP: 10816190 blocks (5281.34MB) on 1 volume at 598 KB/sec
|   DUMP: DUMP IS DONE
|   DUMP: Level 0 dump on Wed Sep 12 09:37:15 2001
? Broken Pipe
sendbackup: size 5408095
sendbackup: end
\





Re: amanda client broken pipe

2001-09-10 Thread John R. Jackson

>I'm getting a weird error on one of my amanda clients (2.4.1p1 on
>freebsd 4.2 using gnu tar v 1.13) ...

I hope you don't really mean "1.13" for tar.  If it's not 1.13.19 or later
(from alpha.gnu.org), you're not getting good index files.

>? sendbackup: index tee cannot write [Broken pipe]

This is usually just a symptom of doing a direct to tape backup (i.e.
bypassing the holding disk) and banging into end of tape.  See if that
matches up with the rest of the E-mail report you got.

John R. Jackson, Technical Software Specialist, [EMAIL PROTECTED]



amanda client broken pipe

2001-09-06 Thread Kurt Yoder

Hello

I'm getting a weird error on one of my amanda clients (2.4.1p1 on
freebsd 4.2 using gnu tar v 1.13) that I'm having trouble understanding:

FAILED AND STRANGE DUMP DETAILS:

/-- galadriel. /usr lev 0 FAILED [data timeout]
sendbackup: start [galadriel.shcorp.com:/usr level 0]
sendbackup: info BACKUP=/usr/bin/tar
sendbackup: info RECOVER_CMD=/usr/bin/tar -f... -
sendbackup: info end
? sendbackup: index tee cannot write [Broken pipe]
sendbackup: error [/usr/bin/tar got signal 13, index returned 1]
\

/tmp/amanda/sendbackup.debug says:

sendbackup: debug 1 pid 25453 ruid 1000 euid 1000 start time Thu Sep  6
02:08:29 2001
/usr/local/libexec/amanda/sendbackup: got input request: GNUTAR /usr 0
1970:1:1:0:0:0 OPTIONS
|;bsd-auth;index;exclude-list=/usr/local/lib/amanda/exclude.gtar;
  parsed request as: program `GNUTAR' disk `/usr' lev 0 since
1970:1:1:0:0:0 opt
`|;bsd-auth;index;exclude-list=/usr/local/lib/amanda/exclude.gtar;'
sendbackup: exclude list file "/usr/local/lib/amanda/exclude.gtar" does
not exist, ignoring
  waiting for connect on 2473, then 2474, then 2475
  got all connections
sendbackup: doing level 0 dump as listed-incremental:
/usr/local/var/amanda/gnutar-lists/galadriel.shcorp.com_usr_0.new
sendbackup: doing level 0 dump from date: 1970-01-01  0:00:00 GMT
sendbackup: spawning "/usr/local/libexec/amanda/runtar" in pipeline
sendbackup: argument list: "gtar" "--create" "--directory" "/usr"
"--listed-incremental"
"/usr/local/var/amanda/gnutar-lists/galadriel.shcorp.com_usr_0.new"
"--sparse" "--one-file-system" "--ignore-failed-read" "--totals"
"--file" "-" "."
sendbackup-gnutar: pid 25456: /usr/local/libexec/amanda/runtar --create
--directory /usr --listed-incremental
/usr/local/var/amanda/gnutar-lists/galadriel.shcorp.com_usr_0.new
--sparse --one-file-system --ignore-failed-read --totals --file
- .
sendbackup: started index creator: "/usr/bin/tar -tf - 2>/dev/null | sed
-e 's/^\.//'"
index tee cannot write [Broken pipe]
error [/usr/bin/tar got signal 13, index returned 1]

any ideas on how to fix this? TIA!



Re: "index tee broken pipe" problem again

2001-08-04 Thread Gaby Hornik

On Thu, 2 Aug 2001, John R. Jackson wrote:

> >... I haven't touched something since weeks.
> 
> Yeah, yeah, yeah.  We've all heard that story before :-).

OK., I'm catched. :-))
I touched something on sunday: my firewall. :-(
Seems to be a firewall problem now, packets were dropped
since I installed SP4 and some hotfixes on my Checkpoint
fireall, complaining "unknown established TCP packet".

Does anybody have experience with that? I changed a file
according to phoneboys FAQ, if that solves the problem
remains to be seen on my next backup tomorrow.

Bye, Gaby 
-- 
enlog AGDaimlerstr. 15 - 89079 Ulm
Dipl. Inf. Gaby Hornik  Germany
Systems Administrator   Tel.:  +49 731 2075134
mailto:[EMAIL PROTECTED] Fax:   +49 731 2075109




Re: "index tee broken pipe" problem again

2001-08-04 Thread Gaby Hornik

On Thu, 2 Aug 2001, John R. Jackson wrote:
> >... I haven't touched something since weeks.
> 
> Yeah, yeah, yeah.  We've all heard that story before :-).

He hereally! :-)))

> >Here is, what amanda report says:
> >? sendbackup: index tee cannot write [Broken pipe]
> 
> What else was in the Amanda report, in particular about this disk or
> from taper?

Nothing, except the STRANGE messages coming with backing up
windows shares over Samba.

taper:
first fail on Monday night: no error message
second fail on Tuesday night: nothing, but planner said someting:
  planner: Last full dump of shop:hda4 on tape enlog-std002 overwritten on
  this run.
Wednesday night: nothing about this special disk
last night: nothing but tape is [OK]

> This typically means the backup was going along OK and the tape server
> side shut down for some reason, such as running out of holding disk
> space or running out of tape.  The "index tee cannot write" is just a
> symptom of something else going on.

Hmmm...I have 50 GB on holding disk, backup was about 10 GB
and this special disk has only 500 MB used.

Now, weekly backup will start in half an our and I will watch
whats happening.

Thanks, Gaby
-- 
enlog AGDaimlerstr. 15 - 89079 Ulm
Dipl. Inf. Gaby Hornik  Germany
Systems Administrator   Tel.:  +49 731 2075134
mailto:[EMAIL PROTECTED] Fax:   +49 731 2075109




"index tee broken pipe" problem again

2001-08-04 Thread Gaby Hornik

Hello!

I have this problem since two days. I can't figure
out whats wrong, because I haven't touched something
since weeks.

I also searched Yahoo egroup for this, but I didn't
find a solution.

Here is, what amanda report says:
/-- shop   hda4 lev 0 FAILED [data timeout]
sendbackup: start [shop:hda4 level 0]
sendbackup: info BACKUP=/sbin/dump
sendbackup: info RECOVER_CMD=/sbin/restore -f... -
sendbackup: info end
|   DUMP: Date of this level 0 dump: Thu Aug  2 00:52:01 2001
|   DUMP: Date of last level 0 dump: the epoch
|   DUMP: Dumping /dev/hda4 (/var) to standard output
|   DUMP: Label: none
|   DUMP: mapping (Pass I) [regular files]
|   DUMP: mapping (Pass II) [directories]
|   DUMP: estimated 431729 tape blocks.
|   DUMP: Volume 1 started at: Thu Aug  2 00:53:17 2001
? sendbackup: index tee cannot write [Broken pipe]
|   DUMP: Broken pipe
|   DUMP: The ENTIRE dump is aborted.
? index returned 1
sendbackup: error [/sbin/dump returned 3]
\

Seems that sendbackup has some problems, so I looked
into sendbackup.debug:

sendbackup: debug 1 pid 1628 ruid 37 euid 37 start time Thu Aug  2
00:52:01 2001
/usr/local/libexec/sendbackup: got input request: DUMP hda4 0
1970:1:1:0:0:0 OPTIONS |;bsd-auth;index;
  parsed request as: program `DUMP' disk `hda4' lev 0 since 1970:1:1:0:0:0
opt `|;bsd-auth;index;'
  waiting for connect on 50054, then 50055, then 50056
  got all connections
sendbackup: spawning "/usr/local/libexec/rundump" in pipeline
sendbackup: argument list: "dump" "0usf" "1048576" "-" "/dev/hda4"
sendbackup: started index creator: "/sbin/restore -tvf - 2>&1 | sed -e '
s/^leaf[]*[0-9]*[   ]*\.//
t
/^dir[  ]/ {
s/^dir[ ]*[0-9]*[   ]*\.//
s%$%/%
t
}
d
'"
index tee cannot write [Broken pipe]
error [/sbin/dump returned 3]

Can somebody give me a hint whats wrong?

Thanks, Gaby
-- 
enlog AGDaimlerstr. 15 - 89079 Ulm
Dipl. Inf. Gaby Hornik  Germany
Systems Administrator   Tel.:  +49 731 2075134
mailto:[EMAIL PROTECTED] Fax:   +49 731 2075109




Re: "index tee broken pipe" problem again

2001-08-02 Thread John R. Jackson

>... I haven't touched something since weeks.

Yeah, yeah, yeah.  We've all heard that story before :-).

>Here is, what amanda report says:
>? sendbackup: index tee cannot write [Broken pipe]

What else was in the Amanda report, in particular about this disk or
from taper?

This typically means the backup was going along OK and the tape server
side shut down for some reason, such as running out of holding disk
space or running out of tape.  The "index tee cannot write" is just a
symptom of something else going on.

>Gaby

John R. Jackson, Technical Software Specialist, [EMAIL PROTECTED]



Re: Broken pipe errors

2001-07-02 Thread Olivier Nicole

>A DDS-3 tape should hold 12GB, compressed, right?  It looks like "taper"
>dies after 10GB.  Does the TAPE-ERROR/short write mean the tape is bad?  If
>the tape was more full, I wouldn't be suspicious; but it seems like I
>should be getting more space out of 125m.

Amanda consider only the non-compressed size.

Compressed size is a commercial only argument, that has NO reality. 

Do NOT calculate with compressed size.

Olivier



Re: Broken pipe errors

2001-07-02 Thread John R. Jackson

>> You don't, by any chance, have both hardware and software compression
>> turned on, do you?
>
>How do I tell, exactly?  I have a dumptype line reading
>
>   compress client fast
>
>But that just seems like which CPU it uses to compute the compression
>algorithm.

This line says you are using software compression.  It also happens to
say it's done on the client, but that doesn't really matter.

Since you're using software compression, you **must** turn off hardware
compression.  How you do that is highly OS dependent (it's based on the
tape device name for Solaris and AIX, I think it's an "mt" command for
Linux, and so on).

If you want to use hardware compression, you should turn off software
compression with:

  compress none

... for all the dumptypes.

I wouldn't advise going that route unless you're having trouble (e.g.
excessive load or wallclock time) with software compression.  Software
does a better job (since it can look at longer strings of data) and is
more flexible (you could turn it off for just certain disks whose data
was already compressed, for instance).

>> Which size specification?  If you mean the one from taper, it's very
>> accurate (to the KByte).
>
>Good.

By "the one from taper" I was referring to the "taper: tape /dev/whatever
kb NNN fm NNN ..." line in the "NOTES" section of the E-mail (or in the
amdump.NN file).  All the other Amanda statistics are based on what was
*successfully* done.  So if the image being processed at the time of
the error was large, they could be way off from where the error happened.

>Drew

John R. Jackson, Technical Software Specialist, [EMAIL PROTECTED]



Re: Broken pipe errors

2001-07-02 Thread Drew Raines

* "John R. Jackson" <[EMAIL PROTECTED]>:
>
> I thought DDS-3 was 12 native and 24 compressed.

You're right.  I had my terminology mixed up.  By "12 compressed" I meant
*after* the compression had taken effect.  Anyway...

> You don't, by any chance, have both hardware and software compression
> turned on, do you?

How do I tell, exactly?  I have a dumptype line reading

compress client fast

But that just seems like which CPU it uses to compute the compression
algorithm.

> Double compressing things often makes them expand,
> i.e. you don't even get the native capacity, and might explain what
> you're seeing.

That could explain a lot.  I inherited this system from a previous admin
who had a few gigabytes less data to back-up.  Now, after a few months have
passed, I'm the one stuck with the tapes filling up.  He very well could
have been using both compression types and not have known it.

> Which size specification?  If you mean the one from taper, it's very
> accurate (to the KByte).

Good.

> If you mean the one from the manufacturer, well, let's just say a lot
> of those folks won't ever have an employment problem as long as there
> are cars (or snake oil :-).

We all can't be engineers... :)

-- 
Drew



Re: Broken pipe errors

2001-07-02 Thread John R. Jackson

>A DDS-3 tape should hold 12GB, compressed, right?  ...

I thought DDS-3 was 12 native and 24 compressed.

>It looks like "taper" dies after 10GB.  ...

Correct.

>Does the TAPE-ERROR/short write mean the tape is bad?  ...

It means taper got an error.  That, in turn, might mean any number of
things are bad (tape, drive, cleaning, cables, distance to the nearest
candy machine, etc).

You don't, by any chance, have both hardware and software compression
turned on, do you?  Double compressing things often makes them expand,
i.e. you don't even get the native capacity, and might explain what
you're seeing.

>How accurate is the size specification of the tape?

Which size specification?  If you mean the one from taper, it's very
accurate (to the KByte).

If you mean the one from the manufacturer, well, let's just say a lot
of those folks won't ever have an employment problem as long as there
are cars (or snake oil :-).

>Drew

John R. Jackson, Technical Software Specialist, [EMAIL PROTECTED]



Re: Broken pipe errors

2001-07-02 Thread Bernhard R. Erdmann

> A DDS-3 tape should hold 12GB, compressed, right?  It looks like "taper"
> dies after 10GB.  Does the TAPE-ERROR/short write mean the tape is bad?  If
> the tape was more full, I wouldn't be suspicious; but it seems like I
> should be getting more space out of 125m.

DDS-3 holds 12 GB uncompressed. Period.
Don't count on what marketing people say. blah, 24 GB, blah... They have
no clue of technical details.

It depends on the compressibility of your data how much will fit with
_either_ hardware or software (gzip/bzip2) compression onto a tape.

_Do not_ use both hard and software compression. _It will_ enlarge your
data.

You get something around 20 GB of the usual data mix onto DDS-3 using
either hardware or software compression.

Lots of source code, executables, news or mails extend compressibility,
lots of mp3's or gzipped files reduce it. It's the amount of entropy in
a file being important for reducing it's size.

Do you really expect some algorithm being able to reduce an mp3 in size
with no loss of information which in turn already has only 1/10th of the
original PCM signal _with_ loss of information? - May this loss be
detectable with your ears or not. How do you want to compress _random_
data? What similarities do you expect in random data to be able to write
them in a short term?

I assume you're using hardware compression with gzipped images.
Something between 9.5 to 10 GB is here the usual limit on DDS-3.

Using software compression makes things clearer for Amanda. She knows
how much an image compresses in full and incrementals. This information
will be used in planner stage deciding what to backup in what level. The
usable tape size value can be very accurate.

Hardware compression saves your CPU times, but Amanda doesn't have a
clue about the compressibility of each image. So you use an average
value for tape size hoping for the best.

> How accurate is the size specification of the tape?

When I pushed Amanda to the limit by adding more and more hosts or
filesystems using software compression, I noticed tapetype's measure
being very accurate. Sometimes it didn't fit onto one tape (DDS-2, 4 GB)
because it failed to write 10-50 MB. This is a real good guess! You'll
have to pay lot's of money to get a measuring instrument working below 1
% error.



Broken pipe errors

2001-07-02 Thread Drew Raines

I'm having some seemingly elementary errors that I don't quite understand.

Using Solaris 2.7, Amanda 2.4.1p1, Sun StorEdge DDS-3 changer on an E450.

>From the amdump log:

  driver: dumping phg:/export/home directly to tape
  driver: send-cmd time 11090.642 to taper: PORT-WRITE 00-00043 phg
/export/home 0 20010630
  driver: result time 11090.656 from taper: PORT 41252
  driver: send-cmd time 11090.660 to dumper0: PORT-DUMP 01-00044 41252 phg
/export/home 0 1970:1:1:0:0:0 DUMP |;bsd-auth;compress-fast;index;
  driver: state time 11090.660 free kps: 1357 space: 15843292 taper: writing
idle-dumpers: 3 qlen tape q: 0 runq: 0 stoppedq: 0 wakeup: 86400 driver-idle: 
not-idle
  driver: interface-state time 11090.660 if : free 1 if LE0: free 1
if LOCAL: free 357
  driver: hdisk-state time 11090.660 hdisk 0: free 15843292 dumpers 0
  taper: writing end marker. [phg-weekly-015 ERR kb 10020128 fm 22]
  driver: result time 14749.051 from dumper0: FAILED 01-00044 ["data write:
    Broken pipe"]
  dumper: kill index command
  driver: result time 14749.051 from taper: TAPE-ERROR 00-00043 [writing
file: short write]
  driver: QUITTING time 14749.137 telling children to quit
  driver: send-cmd time 14749.137 to taper: QUIT
Broken Pipe
  amdump: end at Sat Jun 30 05:05:50 CDT 2001

A DDS-3 tape should hold 12GB, compressed, right?  It looks like "taper"
dies after 10GB.  Does the TAPE-ERROR/short write mean the tape is bad?  If
the tape was more full, I wouldn't be suspicious; but it seems like I
should be getting more space out of 125m.

How accurate is the size specification of the tape?

-- 
Drew



Re: Strange Amanda Error: sendbackup: index tee cannot write [Broken pipe]

2001-05-04 Thread Ryan Williams

The problem ended up being the autosensing media. I have never run into a
problem with a media autosense before but I think from now on I will try to
set that every time I setup a network card.

Thanks for the Help,

Ryan Williams



- Original Message -
From: "John R. Jackson" <[EMAIL PROTECTED]>
To: "Ryan Williams" <[EMAIL PROTECTED]>
Cc: <[EMAIL PROTECTED]>
Sent: Tuesday, May 01, 2001 6:59 PM
Subject: Re: Strange Amanda Error: sendbackup: index tee cannot write
[Broken pipe]


> >It is a 100meg card set to autoselect between all of the 100 and 10 base
T.
>
> Uh, huh.  On Solaris, at least, autoselect is death.  No matter what
> you try to do, it will pick wrong and really bad things start to happen.
> We **always** force the issue with an explicit config file entry.
>
> I don't know that's what's happening to you, but it's been a common
> problem.
>
> >The network is a 10baseT. I am not shure if it is full or half duplex.
...
>
> I'm out of my depth here, but I think if the cable is actually 10Mbit,
> duplex does not matter.  Only 100Mbit bring duplex into the picture.
>
> >What do you make of that. I dont see anywhere that it says what it is
> >actually set at.  ...
>
> Which might make sense if it does not apply because you're really only
> doing 10Mbit.
>
> >> It would also be useful to go to the client and look at
sendbackup*debug
> >> in /tmp/amanda, in particular the start and stop time (first and last
> >> lines).
> >
> >/usr/local/libexec/sendbackup: got input request: DUMP ad0s1e 0
> >1970:1:1:0:0:0 OPTIONS |;bsd-auth;srvcomp-fast;index;
> >  parsed request as: program `DUMP' disk `ad0s1e' lev 0 since
1970:1:1:0:0:0
> >opt `|;bsd-auth;srvcomp-fast;index;'
> >  waiting for connect on 2622, then 2623, then 2624
> >/usr/local/libexec/sendbackup: timeout on mesg port 2623
> >/usr/local/libexec/sendbackup: timeout on index port 2624
> >sendbackup: pid 79500 finish time Tue May  1 01:47:00 2001
>
> You cut off the first line, but in any case, this indicates something else
> is going on.  It says sendbackup on the client got tired (30 seconds)
> of waiting on dumper on the server side to make the connections on
> those ports.  The next line after "waiting for connect ..." should have
> been "got all connections".
>
> The sequence of events is:
>
>   dumper connects to the amandad port on the client
>
>   they do some security stuff (UDP packets) then dumper tells it to
>   start sendbackup
>
>   sendbackup starts listening on two or three new ports and sends their
>   numbers (UDP packet) back to dumper on the server
>
>   dumper connects to those ports (TCP) on the client and data starts
>   to flow
>
> So either dumper never got the list of ports, or the connections it
> tried to make didn't work.  It looks like dumper would log an error if
> the connection failed, which implies sendbackup never saw it.
>
> If you upgrade at least the server to 2.4.2p2, you'll get messages
> like this in the amdump.NN file showing that dumper did its part:
>
>   dumper: stream_client: connected to 128.210.10.26.63832
>   dumper: stream_client: our side is 0.0.0.0.63834
>
> Upgrading the client would show similar extra detail on that side.
>
> I don't know why this would be happening.  Any firewalls or other
> protection between the two machines that would not allow these ports to
> go through?  Any help from the FreeBSD folks?
>
> John R. Jackson, Technical Software Specialist, [EMAIL PROTECTED]
>




Re: Strange Amanda Error: sendbackup: index tee cannot write [Broken pipe]

2001-05-01 Thread John R. Jackson

>It is a 100meg card set to autoselect between all of the 100 and 10 base T.

Uh, huh.  On Solaris, at least, autoselect is death.  No matter what
you try to do, it will pick wrong and really bad things start to happen.
We **always** force the issue with an explicit config file entry.

I don't know that's what's happening to you, but it's been a common
problem.

>The network is a 10baseT. I am not shure if it is full or half duplex. ...

I'm out of my depth here, but I think if the cable is actually 10Mbit,
duplex does not matter.  Only 100Mbit bring duplex into the picture.

>What do you make of that. I dont see anywhere that it says what it is
>actually set at.  ...

Which might make sense if it does not apply because you're really only
doing 10Mbit.

>> It would also be useful to go to the client and look at sendbackup*debug
>> in /tmp/amanda, in particular the start and stop time (first and last
>> lines).
>
>/usr/local/libexec/sendbackup: got input request: DUMP ad0s1e 0
>1970:1:1:0:0:0 OPTIONS |;bsd-auth;srvcomp-fast;index;
>  parsed request as: program `DUMP' disk `ad0s1e' lev 0 since 1970:1:1:0:0:0
>opt `|;bsd-auth;srvcomp-fast;index;'
>  waiting for connect on 2622, then 2623, then 2624
>/usr/local/libexec/sendbackup: timeout on mesg port 2623
>/usr/local/libexec/sendbackup: timeout on index port 2624
>sendbackup: pid 79500 finish time Tue May  1 01:47:00 2001

You cut off the first line, but in any case, this indicates something else
is going on.  It says sendbackup on the client got tired (30 seconds)
of waiting on dumper on the server side to make the connections on
those ports.  The next line after "waiting for connect ..." should have
been "got all connections".

The sequence of events is:

  dumper connects to the amandad port on the client

  they do some security stuff (UDP packets) then dumper tells it to
  start sendbackup

  sendbackup starts listening on two or three new ports and sends their
  numbers (UDP packet) back to dumper on the server

  dumper connects to those ports (TCP) on the client and data starts
  to flow

So either dumper never got the list of ports, or the connections it
tried to make didn't work.  It looks like dumper would log an error if
the connection failed, which implies sendbackup never saw it.

If you upgrade at least the server to 2.4.2p2, you'll get messages
like this in the amdump.NN file showing that dumper did its part:

  dumper: stream_client: connected to 128.210.10.26.63832
  dumper: stream_client: our side is 0.0.0.0.63834

Upgrading the client would show similar extra detail on that side.

I don't know why this would be happening.  Any firewalls or other
protection between the two machines that would not allow these ports to
go through?  Any help from the FreeBSD folks?

John R. Jackson, Technical Software Specialist, [EMAIL PROTECTED]



Re: Strange Amanda Error: sendbackup: index tee cannot write [Broken pipe]

2001-05-01 Thread Ryan Williams


- Original Message -
From: "John R. Jackson" <[EMAIL PROTECTED]>
To: "Ryan Williams" <[EMAIL PROTECTED]>
Cc: <[EMAIL PROTECTED]>
Sent: Monday, April 30, 2001 7:20 PM
Subject: Re: Strange Amanda Error: sendbackup: index tee cannot write
[Broken pipe]


> >Below are the two pertinant parts (I think) of an error that is occuring
on
> >an everyday basis on this server.  ...
>
> What version of Amanda?  Same version on client and server?
2.4.2 on both client and server

>
> >... There are 11 other servers that are being backed up
> >just fine but this one just does not want to work.  ...
>
> So what's different about it?  (just kidding :-).
>
> >... Of course it was upgraded because we could not
> >find a network card that worked decently ...
>
> Are we talking 100 Mbit Ethernet?  Any chance you have a duplex problem
> between the client and switch?  Getting that wrong can cause truly
> amazingly bad performance.
>

It is a 100meg card set to autoselect between all of the 100 and 10 base T.
The network is a 10baseT. I am not shure if it is full or half duplex. It
could be a duplex problem. I believe that our backup network can only
support 1/2 duplex.

rl0: flags=8843 mtu 1500
inet 10.0.0.203 netmask 0xff00 broadcast 10.0.0.255
inet6 fe80::250:baff:fe88:c760%rl0 prefixlen 64 scopeid 0x2
ether 00:50:ba:88:c7:60
media: autoselect (none) status: active
supported media: autoselect 100baseTX  100baseTX
10baseT/UTP  10baseT/UTP 100baseTX



What do you make of that. I dont see anywhere that it says what it is
actually set at. Mabey I should try and manually set the mode on the card.

> >  web45.inte ad0s1f lev 0 FAILED [data timeout]
> >  web45.inte ad0s1a lev 0 FAILED [could not connect to web45.internal]
> >  web45.inte ad0s1e lev 0 FAILED [could not connect to web45.internal]
>
> The first one says Amanda waited 30 minutes between getting started or
> getting a block of data and then gave up.  I'm guessing the other two
> failures were because amandad was still running on the client and so
> the new connections were not allowed.
>
> If you're running 2.4.2 or beyond, you could increase the dtimeout
> value in amanda.conf, but half an hour to wait on data is a long time.
> I suspect it means something else is wrong.
>
1/2 an hour should be plenty of time I would think.

> It would also be useful to go to the client and look at sendbackup*debug
> in /tmp/amanda, in particular the start and stop time (first and last
> lines).
>

/usr/local/libexec/sendbackup: got input request: DUMP ad0s1e 0
1970:1:1:0:0:0 OPTIONS |;bsd-auth;srvcomp-fast;index;
  parsed request as: program `DUMP' disk `ad0s1e' lev 0 since 1970:1:1:0:0:0
opt `|;bsd-auth;srvcomp-fast;index;'
  waiting for connect on 2622, then 2623, then 2624
/usr/local/libexec/sendbackup: timeout on mesg port 2623
/usr/local/libexec/sendbackup: timeout on index port 2624
sendbackup: pid 79500 finish time Tue May  1 01:47:00 2001


> The "index tee cannot write [Broken pipe]" stuff is just a symptom of
> the server giving up on the client.  The server shut down the connections
> and the client was still trying to write, so it got a "broken pipe" error.
> The real problem is why the data stream quit moving.
>
> Some other possibilities:
>
>   * An **extremely** busy disk that tar has a hard time getting access
> to.
>
Judging on a ps auxw, It does not appear that the server is doing an awefull
lot on the disk. It should not be doing much as it is mostly just a mail
server for a small number of users.
>   * A busy client and you have compression turned on, so it just cannot
> get enough CPU.
>
It has a big enough cpu to try and do so I would believe. It is a PII 350
and like I said it does not do much.
>   * A very large file system that compresses extremely well so tar and
> gzip are crunching along but not generating any (enough) output.
>
Small filesystem. The whole server is only about 3-4 gigs used.

>   * A broken disk that is very slow to respond (e.g. lots of retries).
>
I dont see any evidence of this on the server.


>   * Some kind of networking problem, software or hardware.  You might
> try some big ftp put's from the client to /dev/null on the server
> and see what happens.
>
Most likely. I was at first under the impression from the output that it was
a software problem. I am going to look further into the hardware aspects of
it now.

> >Ryan Williams
>
> John R. Jackson, Technical Software Specialist, [EMAIL PROTECTED]
>


Thanks




Re: Strange Amanda Error: sendbackup: index tee cannot write [Broken pipe]

2001-04-30 Thread John R. Jackson

>Below are the two pertinant parts (I think) of an error that is occuring on
>an everyday basis on this server.  ...

What version of Amanda?  Same version on client and server?

>... There are 11 other servers that are being backed up
>just fine but this one just does not want to work.  ...

So what's different about it?  (just kidding :-).

>... Of course it was upgraded because we could not
>find a network card that worked decently ...

Are we talking 100 Mbit Ethernet?  Any chance you have a duplex problem
between the client and switch?  Getting that wrong can cause truly
amazingly bad performance.

>  web45.inte ad0s1f lev 0 FAILED [data timeout]
>  web45.inte ad0s1a lev 0 FAILED [could not connect to web45.internal]
>  web45.inte ad0s1e lev 0 FAILED [could not connect to web45.internal]

The first one says Amanda waited 30 minutes between getting started or
getting a block of data and then gave up.  I'm guessing the other two
failures were because amandad was still running on the client and so
the new connections were not allowed.

If you're running 2.4.2 or beyond, you could increase the dtimeout
value in amanda.conf, but half an hour to wait on data is a long time.
I suspect it means something else is wrong.

It would also be useful to go to the client and look at sendbackup*debug
in /tmp/amanda, in particular the start and stop time (first and last
lines).

The "index tee cannot write [Broken pipe]" stuff is just a symptom of
the server giving up on the client.  The server shut down the connections
and the client was still trying to write, so it got a "broken pipe" error.
The real problem is why the data stream quit moving.

Some other possibilities:

  * An **extremely** busy disk that tar has a hard time getting access
to.

  * A busy client and you have compression turned on, so it just cannot
get enough CPU.

  * A very large file system that compresses extremely well so tar and
gzip are crunching along but not generating any (enough) output.

  * A broken disk that is very slow to respond (e.g. lots of retries).

  * Some kind of networking problem, software or hardware.  You might
try some big ftp put's from the client to /dev/null on the server
and see what happens.

>Ryan Williams

John R. Jackson, Technical Software Specialist, [EMAIL PROTECTED]



Strange Amanda Error: sendbackup: index tee cannot write [Broken pipe]

2001-04-30 Thread Ryan Williams

Below are the two pertinant parts (I think) of an error that is occuring on
an everyday basis on this server. Amcheck runs just fine, it just does not
work on this server. There are 11 other servers that are being backed up
just fine but this one just does not want to work. I recently upgraded the
OS from Freebsd 2.2.7 to Freebsd 4.2. This is when the error started
occuring. The upgrade kind of botched itself when it ran but I was able to
recover most of the server. Of course it was upgraded because we could not
find a network card that worked decently to do a backup with amanda so we
did not have a backup of the server. (3c509's work terriably in Freebsd,
especially under a load like amanda, even the fix that we found on one of
the mailing lists did not fix the problem)

Any help would be appreciated. Also if there is any further information that
I could give pertaining to this backup.

FAILURE AND STRANGE DUMP SUMMARY:
  web45.inte ad0s1f lev 0 FAILED [data timeout]
  web45.inte ad0s1a lev 0 FAILED [could not connect to web45.internal]
  web45.inte ad0s1e lev 0 FAILED [could not connect to web45.internal]



/-- web45.inte ad0s1f lev 0 FAILED [data timeout]
sendbackup: start [web45.internal:ad0s1f level 0]
sendbackup: info BACKUP=/usr/bin/tar
sendbackup: info RECOVER_CMD=/usr/bin/tar -f... -
sendbackup: info end
? sendbackup: index tee cannot write [Broken pipe]
? index returned 1
sendbackup: error [/usr/bin/tar got signal 13]
\


Regards,

Ryan Williams




Re: broken pipe

2001-02-15 Thread Juergen Knott

Am Donnerstag, 15. Februar 2001 09:50 schrieben Sie:

> See?  No failures.  You can use amtoc or amadmin info to find out in
> which tape the filesystem was stored.

OK, no failure. Amadmin tell me that the backup is ok.
I thank you for your help and Infos.

Bye Juergen

-- 
Dies ist eine Microsoft freie Mail!



Re: broken pipe

2001-02-15 Thread Alexandre Oliva

On Feb 15, 2001, Juergen  Knott <[EMAIL PROTECTED]> wrote:

> fileserver   sdb10 1319558413195584   --   40:525380.8  40:555375.7
> fileserver   sdc10 1313555213135552   --   45:114844.5  45:134840.9
> fileserver   sdd10 92850889285088   --   30:535009.9  30:555005.7
> king  hdb10 98416969841696   --   43:483745.2  43:483744.4

See?  No failures.  You can use amtoc or amadmin info to find out in
which tape the filesystem was stored.

-- 
Alexandre Oliva   Enjoy Guarana', see http://www.ic.unicamp.br/~oliva/
Red Hat GCC Developer  aoliva@{cygnus.com, redhat.com}
CS PhD student at IC-Unicampoliva@{lsd.ic.unicamp.br, gnu.org}
Free Software Evangelist*Please* write to mailing lists, not to me



Re: broken pipe

2001-02-15 Thread Juergen Knott

Am Donnerstag, 15. Februar 2001 08:51 schrieben Sie:

Hi Alexandre!

> > Also the backup is finnished correctly?
> I'd guess so, but you didn't post the part of the report that says
> which dumps made to which tape, so I can't tell for sure.
OK, i send you the complete mail from amanda and hope that it helps:

Begin complete mail fom Amanda
These dumps were to tapes Daily01, Daily02.
Tonight's dumps should go onto 2 tapes: Daily03, Daily04.

FAILURE AND STRANGE DUMP SUMMARY:
  fileserver sdc1 lev 0 FAILED ["data write: Broken pipe"]
  fileserver sdc1 lev 0 FAILED [dump to tape failed]


STATISTICS:
  Total   Full  Daily
      
Estimate Time (hrs:min)0:10
Run Time (hrs:min) 3:45
Dump Time (hrs:min)2:41   2:41   0:00
Output Size (meg)   44392.544392.50.0
Original Size (meg) 44392.544392.50.0
Avg Compressed Size (%) -- -- -- 
Filesystems Dumped4  4  0
Avg Dump Rate (k/s)  4713.1 4713.1-- 

Tape Time (hrs:min)2:41   2:41   0:00
Tape Size (meg) 44392.644392.60.0
Tape Used (%) 111.0  111.00.0
Filesystems Taped 4  4  0
Avg Tp Write Rate (k/s)  4710.0 4710.0-- 


FAILED AND STRANGE DUMP DETAILS:

/-- fileserver sdc1 lev 0 FAILED ["data write: Broken pipe"]
sendbackup: start [fileserver:sdc1 level 0]
sendbackup: info BACKUP=/sbin/dump
sendbackup: info RECOVER_CMD=/sbin/restore -f... -
sendbackup: info end
|   DUMP: Date of this level 0 dump: Thu Feb 15 04:14:16 2001
|   DUMP: Date of last level 0 dump: the epoch
|   DUMP: Dumping /dev/sdc1 (/test/platte3) to standard output
|   DUMP: Label: none
|   DUMP: mapping (Pass I) [regular files]
|   DUMP: mapping (Pass II) [directories]
|   DUMP: estimated 13086232 tape blocks.
|   DUMP: Volume 1 started at: Thu Feb 15 04:14:58 2001
|   DUMP: dumping (Pass III) [directories]
|   DUMP: dumping (Pass IV) [regular files]
|   DUMP: 10.23% done, finished in 0:43
|   DUMP: 21.75% done, finished in 0:35
|   DUMP: 33.32% done, finished in 0:30
|   DUMP: 44.95% done, finished in 0:24
|   DUMP: 57.16% done, finished in 0:18
|   DUMP: 68.34% done, finished in 0:13
|   DUMP: 80.59% done, finished in 0:08

\


NOTES:
  taper: tape Daily01 kb 44245472 fm 4 writing file: No space left on device
  taper: retrying fileserver:sdc1.0 on new tape: [writing file: No space left 
on device]
  taper: tape Daily02 kb 13135584 fm 1 [OK]


DUMP SUMMARY:
 DUMPER STATSTAPER STATS 
HOSTNAME DISKL ORIG-KB OUT-KB COMP% MMM:SS  KB/s MMM:SS  KB/s
-- - 
fileserver   sdb10 1319558413195584   --   40:525380.8  40:555375.7
fileserver   sdc10 1313555213135552   --   45:114844.5  45:134840.9
fileserver   sdd10 92850889285088   --   30:535009.9  30:555005.7
king  hdb10 98416969841696   --   43:483745.2  43:483744.4

(brought to you by Amanda version 2.4.2-19991216-beta1)

---End of Mail from Amanda

Bye Jeurgen
-- 
Dies ist eine Microsoft freie Mail!



Re: broken pipe

2001-02-14 Thread Alexandre Oliva

On Feb 15, 2001, Juergen  Knott <[EMAIL PROTECTED]> wrote:

> Also the backup is finnished correctly?

I'd guess so, but you didn't post the part of the report that says
which dumps made to which tape, so I can't tell for sure.

-- 
Alexandre Oliva   Enjoy Guarana', see http://www.ic.unicamp.br/~oliva/
Red Hat GCC Developer  aoliva@{cygnus.com, redhat.com}
CS PhD student at IC-Unicampoliva@{lsd.ic.unicamp.br, gnu.org}
Free Software Evangelist*Please* write to mailing lists, not to me



Re: broken pipe

2001-02-14 Thread Juergen Knott

Am Donnerstag, 15. Februar 2001 08:40 schrieben Sie:

> On Feb 15, 2001, Juergen  Knott <[EMAIL PROTECTED]> wrote:
> >   fileserver sdc1 lev 0 FAILED ["data write: Broken pipe"]
> >   fileserver sdc1 lev 0 FAILED [dump to tape failed]
> [snip]
> >   taper: tape Daily01 kb 44245472 fm 4 writing file: No space left on
> > device
> Looks like tape full.

Ok, the tape is full,

> >   taper: tape Daily02 kb 13135584 fm 1 [OK]
> Looks like Amanda recovered correctly, restarting the dump in the
> following tape.

and used the next tape. Also the backup is finnished correctly?

Bye Juergen
-- 
Dies ist eine Microsoft freie Mail!



Re: broken pipe

2001-02-14 Thread Alexandre Oliva

On Feb 15, 2001, Juergen  Knott <[EMAIL PROTECTED]> wrote:

>   fileserver sdc1 lev 0 FAILED ["data write: Broken pipe"]
>   fileserver sdc1 lev 0 FAILED [dump to tape failed]
[snip]
>   taper: tape Daily01 kb 44245472 fm 4 writing file: No space left on device

Looks like tape full.

>   taper: tape Daily02 kb 13135584 fm 1 [OK]

Looks like Amanda recovered correctly, restarting the dump in the
following tape.

-- 
Alexandre Oliva   Enjoy Guarana', see http://www.ic.unicamp.br/~oliva/
Red Hat GCC Developer  aoliva@{cygnus.com, redhat.com}
CS PhD student at IC-Unicampoliva@{lsd.ic.unicamp.br, gnu.org}
Free Software Evangelist*Please* write to mailing lists, not to me



broken pipe

2001-02-14 Thread Juergen Knott

Hi!

At first i installed amanda for a long time on a tape server and amanda works 
fine.
After insatlling amanda on a client i have this problems.

---Begin Amanda Mail

These dumps were to tapes Daily01, Daily02.
Tonight's dumps should go onto 2 tapes: Daily03, Daily04.

FAILURE AND STRANGE DUMP SUMMARY:
  fileserver sdc1 lev 0 FAILED ["data write: Broken pipe"]
  fileserver sdc1 lev 0 FAILED [dump to tape failed]


STATISTICS:
  Total   Full  Daily
      
Estimate Time (hrs:min)0:10
Run Time (hrs:min) 3:45
Dump Time (hrs:min)2:41   2:41   0:00
Output Size (meg)   44392.544392.50.0
Original Size (meg) 44392.544392.50.0
Avg Compressed Size (%) -- -- -- 
Filesystems Dumped4  4  0
Avg Dump Rate (k/s)  4713.1 4713.1-- 

Tape Time (hrs:min)2:41   2:41   0:00
Tape Size (meg) 44392.644392.60.0
Tape Used (%) 111.0  111.00.0
Filesystems Taped 4  4  0
Avg Tp Write Rate (k/s)  4710.0 4710.0-- 


FAILED AND STRANGE DUMP DETAILS:

/-- fileserver sdc1 lev 0 FAILED ["data write: Broken pipe"]
sendbackup: start [fileserver:sdc1 level 0]
sendbackup: info BACKUP=/sbin/dump
sendbackup: info RECOVER_CMD=/sbin/restore -f... -
sendbackup: info end
|   DUMP: Date of this level 0 dump: Thu Feb 15 04:14:16 2001
|   DUMP: Date of last level 0 dump: the epoch
|   DUMP: Dumping /dev/sdc1 (/drive/drive3) to standard output
|   DUMP: Label: none
|   DUMP: mapping (Pass I) [regular files]
|   DUMP: mapping (Pass II) [directories]
|   DUMP: estimated 13086232 tape blocks.
|   DUMP: Volume 1 started at: Thu Feb 15 04:14:58 2001
|   DUMP: dumping (Pass III) [directories]
|   DUMP: dumping (Pass IV) [regular files]
|   DUMP: 10.23% done, finished in 0:43
|   DUMP: 21.75% done, finished in 0:35
|   DUMP: 33.32% done, finished in 0:30
|   DUMP: 44.95% done, finished in 0:24
|   DUMP: 57.16% done, finished in 0:18
|   DUMP: 68.34% done, finished in 0:13
|   DUMP: 80.59% done, finished in 0:0
\

NOTES:
  taper: tape Daily01 kb 44245472 fm 4 writing file: No space left on device
  taper: retrying fileserver:sdc1.0 on new tape: [writing file: No space left 
on device]
  taper: tape Daily02 kb 13135584 fm 1 [OK]

-- End Message from Amanda


What´s the problem of amanda?

Bye Juergen