Hello!

Here's what I've done.  I've loaded a brand new backup server just out 
of the box, and a test host running actual RHEL4 on a previously unused 
computer.  In other words, 100% of the hardware (including the network 
cables and switch) are completely different.  I get the same error.

I have included detailed information about the systems involved at the 
end of this e-mail, including the BackupPC and rsync logs.  I am running 
the latest BackupPC, the latest File:RsyncP and a standard Red Hat rsync 
binary.  Yet, I get exactly the same error.

I also decided to retest running rsync directly from the backup server, 
using the command-line specified for BackupPC.  It turns out actual 
rsync *does* fail on the backup server.  Here is the error:

BackupPC side:
# rsync --numeric-ids --perms --owner --group -D --links --hard-links 
--times --block-size=2048 --recursive --one-file-system 
[EMAIL PROTECTED]::ROOT/* .
rsync: read error: No route to host
rsync error: error in rsync protocol data stream (code 12) at io.c(177)
rsync: connection unexpectedly closed (1423795 bytes read so far)
rsync error: error in rsync protocol data stream (code 12) at io.c(165)
#

Host side:  /var/log/rsyncd.log
2007/01/31 12:16:59 [2625] rsyncd version 2.6.3 starting, listening on 
port 873
2007/01/31 12:17:16 [2627] name lookup failed for 172.28.16.32: Name or 
service not known
2007/01/31 12:17:16 [2627] rsync on . from [EMAIL PROTECTED] (172.28.16.32)
2007/01/31 12:34:15 [2627] rsync: writefd_unbuffered failed to write 
4096 bytes: phase "unknown" [sender]: Connection timed out (110)
2007/01/31 12:34:15 [2627] rsync error: error in rsync protocol data 
stream (code 12) at io.c(909)

While it still fails, actual rsync seems to copy more data faster than 
File::RsyncP.  In fact, with only 500MB of data, I was actually able to 
get an entire copy done once with the real rsync binary.

To eliminate the possibility of special files causing problems, and to 
generate more test data, I created a test directory and cp -a /usr 4 
times times into the test directory, creating 2.0GB of data (and I later 
increased it to 4GB).  It still fails.

I realized that I've only used my VIA-based backup servers.  Different 
systems *and* different operating systems, but same model of hardware. 
So, I grabbed another computer (IBM NetVista), loaded it with CentOS 3.8 
(just like my backup servers) and ran the above command-line rsync 
against the test data.  Same error.

I then booted Knoppix 3.2 with a 2.4 kernel on the NetVista and did a 
command-line rsync.  Same error!  I then booted Knoppix 3.9 with a 2.6 
kernel:  same error.  Knoppix 4.0.2:  same error.  Knoppix 5.0.1:  Same 
error.

In doing a bunch of tests (which are not all detailed here), I began to 
think that there might be a relationship between system speed and the 
amount of data that could be copied:  the NetVista tended to copy more 
data before the failure than the VIA does (though not always).  So, I 
booted a Knoppix 3.6 CD (3.2 doesn't support the NIC) on my notebook 
(IBM ThinkPad R52, Pentium M 1.86GHz, 1.5GB RAM).  It too failed, but 
much farther:  1.3GHz.  I then booted Knoppix 3.7:  it failed at 700MB. 
  I then booted Knoppix 3.9.  It failed once at 550MB.  I reran it, and 
it finished successfully.  So, I doubled the size of the data to 4GB. 
Reran it:  it failed at over 1GB.  It seems that my notebook will still 
fail, but it's a little harder to get it to fail.

So, I am able to reproduce the error with 3 different computers, 2 
completely different operating system families and a bunch of different 
versions.

Now that I'm on a disposable computer, I can start making changes to the 
host side.

Anything else you want to see on the server side?  I'm out of ideas 
there.  I will start manipulating the host side when I get some more 
free time.

Timothy J. Massey




====================================
Detailed Server and Host Information
====================================

Backup Server
=============
Hardware:  VIA EPIA-MII 1.2GHz motherboard, 512MB RAM, -or-
IBM NetVista 6790-CAU (Intel Pentium 4 1.6GHz), 512MB RAM -or-
IBM ThinkPad R52 (Intel Pentium M 1.86GHz), 1.5GB RAM

OS:  Stock CentOS 3.8
   Minimal install with two additional options
     Web Server (all subitems unchecked)
     Windows Server (all subitems unchecked)
Stock Knoppix 3.2
Stock Knoppix 3.6
Stock Knoppix 3.7
Stock Knoppix 3.9
Stock Knoppix 4.0.2
Stock Knoppix 5.0.1

Non-BackupPC software needed by BackupPC:
   CGI.pm-3.25.tar.gz
   compat-libstdc++-7.3-2.96.128.i386.rpm
   Encode-2.18.tar.gz
   par2cmdline-0.4-alt0.M24.1.i586.rpm
   perl-Archive-Zip-1.16-1.1.el3.rf.noarch.rpm
   perl-Compress-Zlib-1.42-1.el3.rf.i386.rpm
   perl-File-RsyncP-0.68-1.el3.rf.i386.rpm
   perl-HTML-Parser-3.26-17.i386.rpm
   perl-HTML-Tagset-3.03-28.noarch.rpm
   perl-libwww-perl-5.65-6.noarch.rpm
   perl-suidperl-5.8.0-94.EL3.i386.rpm
   perl-URI-1.21-7.noarch.rpm
   perl-XML-Parser-2.31-16.EL3.i386.rpm
   perl-XML-RSS-1.10-1.el3.rf.noarch.rpm

BackupPC 3.0.0

Config.pl
   RsyncArgs:
$Conf{RsyncArgs} = [
             #
             # Do not edit these!
             #
             '--numeric-ids',
             '--perms',
             '--owner',
             '--group',
             '-D',
             '--links',
             '--hard-links',
             '--times',
             '--block-size=2048',
             '--recursive',

             #
             # Rsync >= 2.6.3 supports the --checksum-seed option
             # which allows rsync checksum caching on the server.
             # Uncomment this to enable rsync checksum caching if
             # you have a recent client rsync version and you want
             # to enable checksum caching.
             #
             #'--checksum-seed=32761',

             #
             # Add additional arguments here
             #
             '--one-file-system',
];

# uname -a:
Linux vault.example.com 2.4.21-47.EL #1 Tue Aug 1 08:56:24 EDT 2006 i686 
i686 i386 GNU/Linux
# perl -V
Summary of my perl5 (revision 5.0 version 8 subversion 0) configuration:
   Platform:
     osname=linux, osvers=2.4.21-37.0.1.el.centos3.xfs.0smp, 
archname=i386-linux-thread-multi
     uname='linux sillage.bis.pasteur.fr 
2.4.21-37.0.1.el.centos3.xfs.0smp #1 smp fri jan 20 10:58:28 cet 2006 
i686 athlon i386 gnulinux '
     config_args='-des -Doptimize=-O2 -g -pipe -march=i386 -mcpu=i686 
-Dmyhostname=localhost [EMAIL PROTECTED] -Dcc=gcc -Dcf_by=Red 
Hat, Inc. -Dinstallprefix=/usr -Dprefix=/usr -Darchname=i386-linux 
-Dvendorprefix=/usr -Dsiteprefix=/usr 
-Dotherlibdirs=/usr/lib/perl5/5.8.0 -Duseshrplib -Dusethreads 
-Duseithreads -Duselargefiles -Dd_dosuid -Dd_semctl_semun -Di_db 
-Ui_ndbm -Di_gdbm -Di_shadow -Di_syslog -Dman3ext=3pm -Duseperlio 
-Dinstallusrbinperl -Ubincompat5005 -Uversiononly -Dpager=/usr/bin/less 
-isr'
     hint=recommended, useposix=true, d_sigaction=define
     usethreads=define use5005threads=undef useithreads=define 
usemultiplicity=define
     useperlio=define d_sfio=undef uselargefiles=define usesocks=undef
     use64bitint=undef use64bitall=undef uselongdouble=undef
     usemymalloc=n, bincompat5005=undef
   Compiler:
     cc='gcc', ccflags ='-D_REENTRANT -D_GNU_SOURCE -DTHREADS_HAVE_PIDS 
-DDEBUGGING -fno-strict-aliasing -I/usr/local/include 
-D_LARGEFILE_SOURCE -D_FILE_OFFSET_BITS=64 -I/usr/include/gdbm',
     optimize='-O2 -g -pipe -march=i386 -mcpu=i686',
     cppflags='-D_REENTRANT -D_GNU_SOURCE -DTHREADS_HAVE_PIDS 
-DDEBUGGING -fno-strict-aliasing -I/usr/local/include -I/usr/include/gdbm'
     ccversion='', gccversion='3.2.3 20030502 (Red Hat Linux 3.2.3-54)', 
gccosandvers=''
     intsize=4, longsize=4, ptrsize=4, doublesize=8, byteorder=1234
     d_longlong=define, longlongsize=8, d_longdbl=define, longdblsize=12
     ivtype='long', ivsize=4, nvtype='double', nvsize=8, Off_t='off_t', 
lseeksize=8
     alignbytes=4, prototype=define
   Linker and Libraries:
     ld='gcc', ldflags =' -L/usr/local/lib'
     libpth=/usr/local/lib /lib /usr/lib
     libs=-lnsl -lgdbm -ldb -ldl -lm -lpthread -lc -lcrypt -lutil
     perllibs=-lnsl -ldl -lm -lpthread -lc -lcrypt -lutil
     libc=/lib/libc-2.3.2.so, so=so, useshrplib=true, libperl=libperl.so
     gnulibc_version='2.3.2'
   Dynamic Linking:
     dlsrc=dl_dlopen.xs, dlext=so, d_dlsymun=undef, ccdlflags='-rdynamic 
-Wl,-rpath,/usr/lib/perl5/5.8.0/i386-linux-thread-multi/CORE'
     cccdlflags='-fPIC', lddlflags='-shared -L/usr/local/lib'


Characteristics of this binary (from libperl):
   Compile-time options: DEBUGGING MULTIPLICITY USE_ITHREADS 
USE_LARGE_FILES PERL_IMPLICIT_CONTEXT
   Locally applied patches:
         MAINT18379
   Built under linux
   Compiled at Jun 10 2006 13:23:46
   @INC:
     /usr/lib/perl5/5.8.0/i386-linux-thread-multi
     /usr/lib/perl5/5.8.0
     /usr/lib/perl5/site_perl/5.8.0/i386-linux-thread-multi
     /usr/lib/perl5/site_perl/5.8.0
     /usr/lib/perl5/site_perl
     /usr/lib/perl5/vendor_perl/5.8.0/i386-linux-thread-multi
     /usr/lib/perl5/vendor_perl/5.8.0
     /usr/lib/perl5/vendor_perl
     /usr/lib/perl5/5.8.0/i386-linux-thread-multi
     /usr/lib/perl5/5.8.0
     .
#

rsync copy error:
rsync --numeric-ids --perms --owner --group -D --links --hard-links 
--times --block-size=2048 --recursive --one-file-system 
[EMAIL PROTECTED]::ROOT/* .  (wd: /data/junk)
rsync: read error: No route to host
rsync error: error in rsync protocol data stream (code 12) at io.c(177)
rsync: connection unexpectedly closed (1423795 bytes read so far)
rsync error: error in rsync protocol data stream (code 12) at io.c(165)


BackupPC Host log:
2007-01-31 13:24:55 full backup started for directory ROOT
2007-01-31 13:41:53 Got fatal error during xfer (Child exited prematurely)
2007-01-31 13:41:58 Backup aborted (Child exited prematurely)
2007-01-31 13:41:58 Saved partial dump 0

BackupPC System log:
2007-01-31 13:05:20 Reading hosts file
2007-01-31 13:05:20 BackupPC started, pid 2535
2007-01-31 13:05:20 Running BackupPC_trashClean (pid=2541)
2007-01-31 13:05:20 Next wakeup is 2007-01-31 14:00:00
2007-01-31 13:24:54 User backupuser requested backup of victim (victim)
2007-01-31 13:24:55 Started full backup on victim (pid=2583, share=ROOT)
2007-01-31 13:41:58 Backup failed on victim (Child exited prematurely)
2007-01-31 13:41:58 Running BackupPC_link victim (pid=2619)
2007-01-31 13:43:05 Finished victim (BackupPC_link victim)

dmesg is identical to before starting backup.

# tail /data/pc/victim/NewFileList
3586dfe8fe720770cd191f943f3c7e06 74544 
fROOT/fusr/flib/flibopcodes-2.15.92.0.2.so
0abb6364411a0900ec408630590e55c1 87622 fROOT/fusr/flib/flibopcodes.a
2bafac95f3f9744f5c957fe005a41659 13 fROOT/fusr/flib/flibpanel.so
f711c6feebb1b89a3a75180cddd94a32 15 fROOT/fusr/flib/flibpanel.so.5
b6dd3eb54188794681f1ea5b151c6d83 49335 fROOT/fusr/flib/flibpanel.so.5.4
02d893b8ef453af2ac30a56f338ed98e 14 fROOT/fusr/flib/flibpanelw.so
08a20e152294a77cd40a0aa42a959318 16 fROOT/fusr/flib/flibpanelw.so.5
a7b670210fd963da0c79af78c770c82e 51679 fROOT/fusr/flib/flibpanelw.so.5.4
55b529fdab36a57951798940d18df1c2 23 fROOT/fusr/flib/flibparted-1.6.so.12
3fecc98d6ac16527ba5fe188232b777e 287280 fROOT/fusr/flib/flibparte#

# tail /data/pc/victim/XferLog
   create d 755       0/0        4096 usr/lib/debug/usr/lib/tc
   create   755       0/0        9504 
usr/lib/debug/usr/lib/tc/q_netem.so.debug
   create d 755       0/0        4096 usr/lib/debug/usr/sbin
   create   755       0/0       15656 usr/lib/debug/usr/sbin/nstat.debug
   create   755       0/0       21340 usr/lib/debug/usr/sbin/rtacct.debug
   create   755       0/0        7636 usr/lib/debug/usr/sbin/rtstat.debug
   create   755       0/0       77256 usr/lib/debug/usr/sbin/ss.debug
   create d 755       0/0        4096 usr/lib/games
   create d 755       0/0        4096 usr/lib/gcc-lib
   create d 755       0/0       12288 usr/lib/gconv
#

RHEL4 Host
==========
Hardware:  IBM NetVista 6790-CAU
   Intel Pentium 4 1.6GHz, 512MB RAM

OS:  Stock RHEL4
   Minimal install with zero additional options

Non-BackupPC software needed by BackupPC:
   None

# uname -a:
Linux victim.example.com 2.6.9-5.EL #1 Wed Jan 5 19:22:18 EST 2005 i686 
i686 i386 GNU/Linux
# rsync --version
rsync  version 2.6.3  protocol version 28
Copyright (C) 1996-2004 by Andrew Tridgell and others
<http://rsync.samba.org/>
Capabilities: 64-bit files, socketpairs, hard links, symlinks, batchfiles,
               inplace, IPv6, 64-bit system inums, 64-bit internal inums

rsync comes with ABSOLUTELY NO WARRANTY.  This is free software, and you
are welcome to redistribute it under certain conditions.  See the GNU
General Public Licence for details.
#

rsyncd.log:
2007/01/31 12:16:59 [2625] rsyncd version 2.6.3 starting, listening on 
port 873
2007/01/31 12:17:16 [2627] name lookup failed for 172.28.16.32: Name or 
service not known
2007/01/31 12:17:16 [2627] rsync on . from [EMAIL PROTECTED] (172.28.16.32)
2007/01/31 12:34:15 [2627] rsync: writefd_unbuffered failed to write 
4096 bytes: phase "unknown" [sender]: Connection timed out (110)
2007/01/31 12:34:15 [2627] rsync error: error in rsync protocol data 
stream (code 12) at io.c(909)

dmesg is identical to before starting backup.


-------------------------------------------------------------------------
Using Tomcat but need to do more? Need to support web services, security?
Get stuff done quickly with pre-integrated technology to make your job easier.
Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo
http://sel.as-us.falkag.net/sel?cmd=lnk&kid=120709&bid=263057&dat=121642
_______________________________________________
BackupPC-users mailing list
BackupPC-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/backuppc-users
http://backuppc.sourceforge.net/

Reply via email to