Hi,
I am indeed claiming that I think files are not being deleted and I wish I was able to disprove this and identify an alternative reason for the behavior I'm seeing. However, I haven't read of a single way to allow me to do this so far.
My hypothesis is, that it is unlikely that unreferenced files are cleaned up correctly, when several actions that should have restored disk space, such as removing hosts from my backups, one of which had almost 90Gb worth of operating system ISOs and disk images, or reducing the number of kept full and incremental backups, fail to make a noticeable dent in the filesystem usage of my backuppc partition.
I am more than willing to identify (or rule out) specific errors or causes, but I am unfortunately indeed unable to, because I just don't know how or the tools to do so do not exist.
As far as I understand, de-duplication happens at file-level, not byte-level (pooling) [0]. So ISO files that are unique across all hosts should be fairly noticeable when they get added or removed.
Now, I understood that I can find a cpool entry from an attrib_ file within the pc hierarchy, but what would be more interesting to me would be to find a pc entry from a file/md5 hash in the cpool without having to zcat each one and inspect its actual contents.
Is there any way to do that? So far, neither find nor grep have been able to turn up a reference in pc for files in cpool.
It looks like pc has an attrib_ file, the attrib_ file points to a file in the cpool [1]. That file in the cpool is a directory listing including some properties, but its a binary format (perl pack?) and I'm not sure how to proceed from here.
David [0] https://backuppc.github.io/backuppc/BackupPC.html#Storage-layout [1] https://backuppc.github.io/backuppc/BackupPC.html#Attribute-file-format On 24/02/2023 16:16, backu...@kosowsky.org wrote:
Yes - but no one has come remotely close to showing that there is a problem with BackupPC pooling and de-duping itself. Unfortunately, it seems like not a single specific file was ever identified as being either unnecessarily duplicated or mistakenly not deleted from the pool. It's all great to*claim* that you don't think files are being deleted but it's really hard to give any credence to such claim let alone address it if no one is willing and/or able to identify specific errors then can be used to troubleshoot.
-- *TenTwentyFour S.à r.l.* www.tentwentyfour.lu <https://www.tentwentyfour.lu> *T*: +352 20 211 1024 *F*: +352 20 211 1023 1 place de l'Hôtel de Ville 4138 Esch-sur-Alzette
_______________________________________________ BackupPC-users mailing list BackupPC-users@lists.sourceforge.net List: https://lists.sourceforge.net/lists/listinfo/backuppc-users Wiki: https://github.com/backuppc/backuppc/wiki Project: https://backuppc.github.io/backuppc/