Marco Nicolayevsky <ma...@specialtyvalvegroup.com> wrote on 05/14/2014 
08:47:24 PM:

> Hello all,
> 
> I am using a pretty vanilla installation of BackupPC and love the 
> simplicity and fact that it just works as it’s supposed to.
> 
> My problem arises when trying to back up windows clients over rsyncd.
> 
> I have 3 win boxes, all running Win7 and the rsncd server. 
> Functionally, they perform fine. I’m able to backup, restore, etc. 
> My initial tests were with small folders and not the entire drive.
> 
> HOWEVER… each win box has somewhere between 20GB and 3TB of data, 
> and despite being on wired gigabit Ethernet, the 3TB machine ran for
> over 8 days and still wasn’t done before I pulled the plug and 
> called it quits. Backing up 3TB over gig-e should be able to be 
> accomplished in under 1 day, so I’m at a loss of what to do next.

Here are some hard numbers for one of my systems.  All targets 
Windows-based, using rsyncd, no SSH, no encryption, no compression. 
Servers are using 7200RPM SATA drives.

Full size:  1372GB, 1114587 files.  Time to complete:  1175.3 minutes.
(Intel Xeon E5 CPU, 16GB RAM, 6-drive Software RAID-6)

Full size:  627GB, 452759 files.  Time to complete:  682.4 minutes.
(Intel Atom D610 (really slow) CPU, 4GB RAM, 5-drive Software RAID-6)

Now, these are established backups.  I've found that initial backups can 
be 2-3 times as long.  And at 3TB, you're nearly triple of the bigger one. 
 So, that could mean that the initial backup can take 6000 minutes or 
more, or four days.  And personally, I've found that compression can 
double or triple the time again.  Worst-case of all of that could be two 
weeks!  So it is possible that you're seeing correct operation for that 
first backup, and that future backups should be much better.

Also, those are fulls.  Incrementals take 4 hours and 1 hour, 
respectively.

> Is rsync “really” that slow?

No:  rsync is only going to be marginally slower than a non-rsync copy, 
even on the first time, assuming you're not limited by something else (CPU 
or RAM) that would not be a limit for a normal copy.

That could be related to the number of files:  that's an area where rsync 
can get tripped up.  As you can see, I've got >1 million files, so the 
definition of "too many" is pretty big.  But if you had, say, 10M files, 
maybe that's an issue to consider.

> Am I doing something wrong?

*You*?  Who knows?  Is something wrong?  Possibly.

> Is it 
> limited to just the windows client?

No.

> When I was evaluating Bacula, I 
> was able to do the same backup in 1/10th the time (or at least it 
> felt that way since I don’t have hard numbers).

You've got hard numbers above to consider as a comparison and see if they 
will fit your needs.

> Before giving up on BackupPC and considering an alternative, can 
> someone give me some advice?

Yes.  First, you've given us *NOTHING* to go on other than "it's slow". 
It's like telling a mechanic "my car doesn't run right."  Of course, 
you're probably expecting to pay the mechanic, so he's incented to ask 
lots of questions to figure things out.  I think it's pretty telling that 
I can't see a single reply to your request.  While your request includes a 
lot of words, there was almost *NOTHING* of substance in your request 
except a *VERY* brief description of your hardware.  We have almost no 
description of what your backups look like (size and number of files, for 
example), what your backup jobs look like (compression being a very big 
one), or what your backup server is doing (CPU, memory or disk usage). And 
we're not getting paid, so we're not really incented to ask you a lot of 
questions.  But I'll give you a few:

First, is your system limited in some way?  Are you hitting 100% CPU, or 
swapping (not enough RAM), or are your disk overloaded, or something else? 
 Learn to use top and vmstat (or dstat).

Is your infrastructure fundamentally limited in some way?  Have you tried 
doing a straight-up copy from a target to your backup system to make sure 
that the underlying infrastructure is capable of delivering what you 
expect it to?  If you can only get 1-2MB/s copies using SMB, tar, NFS, 
FTP, etc. , then that's all you'll get with BackupPC, too.  But if you can 
get 70MB/s copies between the same two systems some other way, then we can 
expect better of BackupPC.  (But all that does is re-ask the question of 
what is limiting you.)

From my e-mail, you know what is possible and reasonable to get.  If 
you're far away from those results, then you need to figure out what is 
different about your system and causing the slowdown.

The second thing to try is to simplify things.  For me, the first thing I 
do is disable compression.  In today's multi-core universe, compression is 
rapidly becoming a bottleneck again.  The compression algorithms in common 
use today do *not* use multiple cores.  On a system with more than a 
couple of disks I can easily max out one core with compression.

Or, try to use SMB instead of rsyncd.  I would not suggest that as a final 
stop (I really like rsyncd for my systems), but it will help you to see if 
there is a problem with something unique to rsync, rather than 
fundamentally wrong with your system.

Another option would be to expand your testing from a very small section 
to something larger:  say, 100GB.  That is big enough to be somewhat 
representative of the whole, but should be able to complete quickly enough 
even with compression, encryption, etc. to get some baseline numbers to 
work with, including both the *first* and *additional* full backups.  That 
way, you might find that the initial backup will take a week, but each 
additional backup after that will only take 12 hours and you're OK with 
that.  Or, you might find that things are still broken, but now it won't 
cost you a week of your life every time you want to test.

> If it helps, the server is beefy, quad-core i5 proc w/ 4gb ram and 6TB 
raid5.

Eh.  That's pretty baseline for anything you would call a server today, 
and an i5 processor screams client-grade, not server grade.  It's not 
especially beefy, but BackupPC doesn't really require beefy.  See my stats 
on a really crummy embedded-class CPU and software RAID-6!  :)

Tim Massey


 
Out of the Box Solutions, Inc. 
Creative IT Solutions Made Simple!
http://www.OutOfTheBoxSolutions.com
tmas...@obscorp.com 
 
22108 Harper Ave.
St. Clair Shores, MI 48080
Office: (800)750-4OBS (4627)
Cell: (586)945-8796 

------------------------------------------------------------------------------
"Accelerate Dev Cycles with Automated Cross-Browser Testing - For FREE
Instantly run your Selenium tests across 300+ browser/OS combos.
Get unparalleled scalability from the best Selenium testing platform available
Simple to use. Nothing to install. Get started now for free."
http://p.sf.net/sfu/SauceLabs
_______________________________________________
BackupPC-users mailing list
BackupPC-users@lists.sourceforge.net
List:    https://lists.sourceforge.net/lists/listinfo/backuppc-users
Wiki:    http://backuppc.wiki.sourceforge.net
Project: http://backuppc.sourceforge.net/

Reply via email to