I've been running 3.1.0 since it was created and I finished setting things up a while ago and it ran for months rock solid, but then one day some machines stopped working.

One Gentoo vserver is running 4 of the machines. Gentoo host OS, Ubuntu, Debian and a Gentoo. BackupPC manages to backup the Ubuntu, but not the host OS or the two other vservers.

It also fails on a remote internet server running Gentoo, but not the other server that runs Ubuntu.

The Debian vserver log file looks like this and that seems to be the same error for all of the failing machines:

full backup started for directory /; updating partial #117
Running: /usr/bin/ssh -q -x -l root liferaft vserver debian exec /usr/bin/rsync --server --sender --numeric-ids --perms --owner --group -D --links --hard-links --times --block-size=2048 --recursive --checksum-seed=32761 --ignore-times . /
Xfer PIDs are now 23937
Got remote protocol 30
Negotiated protocol version 28
Checksum caching enabled (checksumSeed = 32761)
Sent exclude: /dev
Sent exclude: /exports
Sent exclude: /home
Sent exclude: /media
Sent exclude: /mnt
Sent exclude: /proc
Sent exclude: /pub
Sent exclude: /srv
Sent exclude: /sys
Sent exclude: /tmp
Sent exclude: /usr/portage
Sent exclude: /var/lock
Sent exclude: /var/run
Sent exclude: /var/tmp
Xfer PIDs are now 23937,23949
  create d 755       0/0        4096 .
  create d 755       0/0        4096 bin
  pool     755       0/0      688492 bin/bash
  pool     755       0/0       25216 bin/bunzip2
  pool     755       0/0       25216 bin/bzcat -> bin/bunzip2
  pool   l 777       0/0           6 bin/bzcmp -> bzdiff
  pool     755       0/0        2128 bin/bzdiff
  pool   l 777       0/0           6 bin/bzegrep -> bzgrep
  pool     755       0/0        4874 bin/bzexe
  pool   l 777       0/0           6 bin/bzfgrep -> bzgrep
  pool     755       0/0        3642 bin/bzgrep
  pool     755       0/0       25216 bin/bzip2 -> bin/bunzip2
  pool     755       0/0        8064 bin/bzip2recover
  pool   l 777       0/0           6 bin/bzless -> bzmore
  pool     755       0/0        1297 bin/bzmore
  pool     755       0/0       26860 bin/cat
  pool     755       0/0       45344 bin/chgrp
  pool     755       0/0       42744 bin/chmod
  pool     755       0/0       47356 bin/chown
  pool     755       0/0       69284 bin/cp
  pool     755       0/0       55052 bin/date
  pool     755       0/0       47852 bin/dd
  pool     755       0/0       45016 bin/df
  pool     755       0/0       92312 bin/dir
  pool     755       0/0        4428 bin/dmesg
  pool     755       0/0        8592 bin/dnsdomainname
  pool     755       0/0       24228 bin/echo
  pool     755       0/0       92436 bin/egrep
  pool     755       0/0       22120 bin/false
  pool     755       0/0       52880 bin/fgrep
  pool     755       0/0      100468 bin/grep
  same     755       0/0          61 bin/gunzip
  same     755       0/0        5864 bin/gzexe
  pool     755       0/0       53420 bin/gzip
  pool     755       0/0        8592 bin/hostname
  pool     755       0/0       12348 bin/kill
Read EOF:
Tried again: got 0 bytes
finish: removing in-process file bin/ln
Child is aborting
Parent read EOF from child: fatal error!
Done: 34 files, 1591016 bytes
Got fatal error during xfer (Child exited prematurely)
Backup aborted (Child exited prematurely)



This one died quick, but it's completely random at what point it fails on the machines. As protocol 30 shows, I've upgraded rsync to 3.0.0 to see if I had a bad rsync 2.6.x (All the dist has upgraded to the same one) that for some reaosn only Ubuntu had fixed, but it still fails.

It sometimes aborts on a singal PIPE on some of the machines and some logs has a can't write to socket error.

The total randomness of what works and whats broken has me completely puzzled.

Attachment: smime.p7s
Description: S/MIME Cryptographic Signature

-------------------------------------------------------------------------
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2008.
http://clk.atdmt.com/MRT/go/vse0120000070mrt/direct/01/
_______________________________________________
BackupPC-users mailing list
[email protected]
List:    https://lists.sourceforge.net/lists/listinfo/backuppc-users
Wiki:    http://backuppc.wiki.sourceforge.net
Project: http://backuppc.sourceforge.net/

Reply via email to