Hello,

  my perl knowledge is very minimal, so I am unable to use poolCnt files
until someone will describe them as documentation.

  But I used BackupPC_ls to get needed information. Counted only files
("-" in attrs) it's size divided by number of links, which makes me best
results to compare, which backup is largest.

  If someone needs this, my python script live here:
        http://www.salstar.sk/pub/salpack/usr/bin/backuppc_size.py

                                                        SAL

On Wed, Sep 02, 2020 at 11:12:48PM -0700, Craig Barratt wrote:
> As I said in my previous email, I'd recommend using the poolCnt files since
> they accurately reflect what is being stored for each host.  But it will
> take a bit more coding.  You can look in some of the BackupPC utilities
> (eg: BackupPC_refCountUpdate) for examples of how to read the poolCnt files.
> 
> Craig
> 
> On Wed, Sep 2, 2020 at 11:03 PM Ján ONDREJ (SAL) <ondr...@salstar.sk> wrote:
> 
> > Hello,
> >
> >   trying to make a script, which can count backup size, but without success
> > yet. I can parse XferLOG, extract sizes, identify links, but after summary
> > some backups have zero or almost zero size. Looks like this happens, when
> > there is no initial backup (backup with id 0), where all files have been
> > transferred. My XferLOGs contains only changes.
> >
> > Here is what I counted from XferLOG for one server:
> > digitall/XferLOG.625.z: 4.788 MB
> > digitall/XferLOG.628.z: 0.899 MB
> > digitall/XferLOG.629.z: 19.221 MB
> > digitall/XferLOG.623.z: 0.059 MB
> > digitall/XferLOG.627.z: 0.000 MB
> > digitall/XferLOG.622.z: 0.065 MB
> > digitall/XferLOG.624.z: 0.138 MB
> > digitall/XferLOG.626.z: 0.060 MB
> > HOST: digitall: 25.231 MB
> >
> > As you see, all files from XferLOG uses 25 MB of disk space.
> > But I see an 4 GB file stored in my backup, when checking from BackupPC web
> > interface.
> >
> > I still think, that this information about backup File Size is useless.
> > What does it mean?
> > 1. how much files have been transferred: NO
> > 2. how much files are stored on disk: NO
> > 3. how much files are on source filesystem: NO
> >
> > I really don't know, what this means.
> >
> > Please, at least let me know, how I can check, how much files are stored
> > in each backup. I can parse them all using BackupPC_tarCreate or something
> > similar, or using backuppcfs, but this is very slow for large backups.
> >
> >                                                         SAL
> >
> > On Mon, Aug 31, 2020 at 12:20:13PM -0700, Craig Barratt wrote:
> > > I've tried to suggest a couple of reasons that could explain what you are
> > > seeing, based on very incomplete information.
> > >
> > > Without you confirming what the issue actually is, your conclusion is
> > > already that it's "absolutely buggy" and "useless".  To pick another
> > > example, rsync -aHv will also report a total file size that is the sum of
> > > the hardlink file sizes, and it also reports the actual bytes transferred
> > > and the speedup.  So by your logic, does that mean rsync is also
> > > "absolutely buggy" and "useless"?
> > >
> > > I'd recommend you actually understand the issue, and then decide what the
> > > best options are.
> > >
> > > If your question is "is there a reasonable way to potion pool usage among
> > > backup clients?" you are asking a question that doesn't have a simple
> > > answer, because of hardlinks on the clients and pooling among all the
> > > clients.
> > >
> > > That said, it wouldn't be too hard to write a script that reads the
> > > reference counts for a client (which includes all the backups for that
> > > client), and apportioning the pool file sizes to that client based on the
> > > ratio of its own reference count to the total pool reference count for
> > each
> > > pool file.  But that's just one way of doing it.  And commercial
> > > auditing/billing tools are well out of scope for BackupPC, but you are
> > most
> > > welcome to contribute anything you develop.
> > >
> > > Craig
> > >
> > > On Mon, Aug 31, 2020 at 9:38 AM Ján ONDREJ (SAL) <ondr...@salstar.sk>
> > wrote:
> > >
> > > > Hello,
> > > >
> > > >   thanks for explanation, but how I can check in backuppc, which user
> > > > uses how much disk space of my backuppc storage? This way File Size
> > counter
> > > > is absolutelly buggy.
> > > >
> > > >   I need to check, which backup uses most of my space and need to find,
> > > > where I should exclude more files. But there is no information, which
> > > > I can use. New files are only new files, doesn't count how much files
> > > > there are. In Total files hardlinked files are counter multiple times,
> > > > which ends in 10x more space usage in this counter like it's real.
> > > >
> > > >   This way total files counter is useless, only useful for windows
> > users,
> > > > which don't use hardlinks.
> > > >
> > > >                                                         SAL
> > > >
> > > > On Mon, Aug 31, 2020 at 09:09:02AM -0700, Craig Barratt via
> > BackupPC-users
> > > > wrote:
> > > > >  That file is a hardlink, not a symlink. In the backup stats, each
> > > > instance
> > > > > of a hardlink is counted towards the total file size.
> > > > >
> > > > > If your file system has a lot of hardlinks, perhaps that's why the
> > > > reported
> > > > > number is higher than you expect?
> > > > >
> > > > > Craig
> > > > >
> > > > > On Mon, Aug 31, 2020 at 12:40 AM Ján ONDREJ (SAL) <
> > ondr...@salstar.sk>
> > > > > wrote:
> > > > >
> > > > > > Hello,
> > > > > >
> > > > > > On Mon, Aug 31, 2020 at 12:08:50AM -0700, Craig Barratt via
> > > > BackupPC-users
> > > > > > wrote:
> > > > > > > Does your backup include sparse files?
> > > > > >
> > > > > > I think no.
> > > > > >
> > > > > > > Look in the XferLOG file to see the sizes of individual files -
> > it
> > > > > > > shouldn't be too hard to spot one that is large.
> > > > > >
> > > > > > There is no one large file. As I wrote, restored backup is not so
> > large
> > > > > > too.
> > > > > > But you pointed me to right plate. I see this line in XferLOG:
> > > > > >
> > > > > >     new    recv hf..tpog... rw-r--r--     1000,    1000  25089367
> > > > > > var/www/public/media/598522/catalogue.pdf =>
> > > > > > var/www/private/import/docs/catalogue.pdf
> > > > > >
> > > > > > This is a symlink and it's size is counted as 25089367.
> > > > > > According to "=>" symbol, this symlink is properly identified as
> > > > symlink,
> > > > > > but it's size is stored as symlinks target file. This is why backup
> > > > > > size is larger than my filesystem. Can this be fixed?
> > > > > >
> > > > > >                                                         SAL
> > > > > >
> > > > > > > On Sun, Aug 30, 2020 at 11:51 PM Ján ONDREJ (SAL) <
> > > > ondr...@salstar.sk>
> > > > > > > wrote:
> > > > > > >
> > > > > > > > Hello,
> > > > > > > >
> > > > > > > >   I am using BackupPC for years, but after update to v4
> > (4.4.0),
> > > > some
> > > > > > > > backups have inconsistent size dislayed in "File Size/Count
> > Reuse
> > > > > > Summary"
> > > > > > > > table.
> > > > > > > >
> > > > > > > >   This is my servers directory, which should be in backup:
> > > > > > > >
> > > > > > > > Filesystem                 Size  Used Avail Use% Mounted on
> > > > > > > > /dev/md0                   4.0G  3.0G  1.1G  75% /
> > > > > > > > /dev/mapper/vg_server-www  200G  119G   82G  60% /var/www
> > > > > > > >
> > > > > > > > There is no other filesystem mounted, binded to this directory.
> > > > > > > >
> > > > > > > > It's complete size is 200 GB, used only 118 GB. Some files are
> > > > excluded
> > > > > > > > from
> > > > > > > > backup, so an full backup should be about 120 GB uncompressed,
> > > > after
> > > > > > > > compression less. But our File Size table looks like:
> > > > > > > >
> > > > > > > >               Totals            Existing Files             New
> > > > Files
> > > > > > > > Backup# Type  #Files  Size/MiB  MiB/sec  #Files  Size/MiB
> > #Files
> > > > > > Size/MiB
> > > > > > > > 0       full  3220584 943488.6  26.86    2625461 872577.7  1232
> > > >  537.4
> > > > > > > >
> > > > > > > > As you see, this backup total size is 921 GiB. How it's
> > possible,
> > > > that
> > > > > > > > an 200 GB partition is stored as 900 GB?
> > > > > > > > Also according to i-nodes in linux, my server has:
> > > > > > > >
> > > > > > > > Filesystem                   Inodes   IUsed     IFree IUse%
> > > > Mounted on
> > > > > > > > /dev/md0                    4194240   51082   4143158    2% /
> > > > > > > > /dev/mapper/vg_fusion-www 209715200 2470434 207244766    2%
> > > > /var/www
> > > > > > > >
> > > > > > > > So there is 2.5 millions of files, some excluded, but results
> > as
> > > > 3.2
> > > > > > > > millions
> > > > > > > > on backup.
> > > > > > > >
> > > > > > > > Trying to restore files. Restore downloaded a 68GB tar package.
> > > > > > > > This looks to be real, but if there are only 70 GB of data,
> > why on
> > > > > > backuppc
> > > > > > > > status it's displayed as 900 GB?
> > > > > > > >
> > > > > > > > I need to find, which server is using most of my backup space.
> > > > > > > > I know, that it's hard to find, because files are shared
> > between
> > > > > > servers
> > > > > > > > (deduplicated), but at least I can estimate it. But if there
> > are
> > > > > > multiples
> > > > > > > > of usage displayed in statistics, then it's impossible to
> > > > aproximate.
> > > > > > > >
> > > > > > > > I deleted all backups of this server, in hope, that it helps,
> > but
> > > > > > don't.
> > > > > > > > :-(
> > > > > > > >
> > > > > > > > Thank for help.
> > > > > > > >
> > > > > > > >                                                 SAL
> > > > > > > >
> > > > > > > >
> > > > > > > >
> > > > > > > > _______________________________________________
> > > > > > > > BackupPC-users mailing list
> > > > > > > > BackupPC-users@lists.sourceforge.net
> > > > > > > > List:
> > > > https://lists.sourceforge.net/lists/listinfo/backuppc-users
> > > > > > > > Wiki:    https://github.com/backuppc/backuppc/wiki
> > > > > > > > Project: https://backuppc.github.io/backuppc/
> > > > > > > >
> > > > > >
> > > > > >
> > > > > > > _______________________________________________
> > > > > > > BackupPC-users mailing list
> > > > > > > BackupPC-users@lists.sourceforge.net
> > > > > > > List:
> > https://lists.sourceforge.net/lists/listinfo/backuppc-users
> > > > > > > Wiki:    https://github.com/backuppc/backuppc/wiki
> > > > > > > Project: https://backuppc.github.io/backuppc/
> > > > > >
> > > > > >
> > > > > >
> > > > > > _______________________________________________
> > > > > > BackupPC-users mailing list
> > > > > > BackupPC-users@lists.sourceforge.net
> > > > > > List:
> > https://lists.sourceforge.net/lists/listinfo/backuppc-users
> > > > > > Wiki:    https://github.com/backuppc/backuppc/wiki
> > > > > > Project: https://backuppc.github.io/backuppc/
> > > > > >
> > > >
> > > >
> > > > > _______________________________________________
> > > > > BackupPC-users mailing list
> > > > > BackupPC-users@lists.sourceforge.net
> > > > > List:    https://lists.sourceforge.net/lists/listinfo/backuppc-users
> > > > > Wiki:    https://github.com/backuppc/backuppc/wiki
> > > > > Project: https://backuppc.github.io/backuppc/
> > > >
> > > >
> >


_______________________________________________
BackupPC-users mailing list
BackupPC-users@lists.sourceforge.net
List:    https://lists.sourceforge.net/lists/listinfo/backuppc-users
Wiki:    https://github.com/backuppc/backuppc/wiki
Project: https://backuppc.github.io/backuppc/

Reply via email to