Hi,

Krsnendu dasa wrote on 21.01.2007 at 13:45:55 [Re: [BackupPC-users] Avoiding 
long backup times]:
> On 20/01/07, Holger Parplies <[EMAIL PROTECTED]> wrote:
> > > I read, that backuppc stores every identical file only one time.
> >
> > Basically true, but BackupPC needs to determine that the file is identical 
> > to
> > something and to what. Sparing you the transfer (if possible with reasonable
> > cost) is rsyncs job. Doing so comes at a cost in terms of CPU usage, so
> > you've got the option of using tar if bandwidth is cheaper than CPU power.
> > In your case, it obviously is not.
> >
> > > [...] What happens with an unchanged file in a full backup?
> >
> > tar and smb will transfer it, rsync will not (presuming you mean
> > "unchanged" as in "same name, same content").
> >
> > For a changed file, rsync will try to speed up the transfer. If you append a
> > few bytes to a large file, tar/smb will transfer the whole file (even on an
> > incremental), while rsync will (basically) transfer some checksums and the
> > few bytes only (on full and incremental backups).
> >
> So...
> 
> With tar and smb the file is transfered then it is checked to see if
> it is the same as a file already in the pool. If it is already in the
> pool a link is created and the copied file is deleted.
> 
> Whereas rsync checks if the file is the same as another in the pool
> before transferring?

No.

Actually, I was thinking of *native rsync* in what I wrote. As I read the
code (or the comments, rather ;-), the same basically applies to
BackupPC/File::RsyncP, though there are two additional challenges worth
mentioning: compression of the local file and pooling.

As Les pointed out, the point of reference is always the same file in the
previous full backup, not any file in the pool that may (later) turn out to
have identical contents. This reference file may be compressed. In any case,
it's part of a previous backup, so the contents cannot be changed "in place".
As I understand it, BackupPC will create a local directory tree containing
the required files (all for a full backup, only the changed ones for an
incremental), much as native rsync would with the '--copy-dest' or
'--compare-dest' options respectively (meaning, an identical file will be
copied locally, not transfered, and a changed file will be partially transfered
if possible), except that BackupPC will already compress the local file if
compression is turned on. Later on, BackupPC_link will replace files in this
directory tree by hardlinks to pools files or add them to the pool as
appropriate.

I like the idea of taking *any* identical file from the pool as reference
though. I don't know if it is possible (i.e. whether the remote rsync will
transfer one checksum covering the whole file before the local part needs to
commit to selecting one file), but it certainly would be a worthwhile speedup.

Regards,
Holger

-------------------------------------------------------------------------
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT & business topics through brief surveys - and earn cash
http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV
_______________________________________________
BackupPC-users mailing list
BackupPC-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/backuppc-users
http://backuppc.sourceforge.net/

Reply via email to