Re: Archive integrity/corruption question.

Leland C. Best Sat, 07 Feb 2026 14:05:56 -0800

Hi Patrik, et al,

Thanks for taking the time to expand on Eric's reply.  More below ...


On 2/7/26 5:31 AM, Patrik Dufresne wrote:

Hi Leland,

I want to add a couple of important points to Eric's response:
Regarding the hardware issue, this type of corruption would affect ANYbackup software when the underlying hardware has memory errors. It'snot specific to rdiff-backup. Any tool would write corrupted data whenrunning on faulty RAM. This is exactly why using ECC memory on backupservers is so critical. Data integrity depends on reliable hardware,and ECC memory can detect and correct these errors before they corruptyour backups on disk.

Yes. Just to be clear, my original post wasn't intended to point to anyfault in 'rdiff-backup'. I just wanted to make sure I was correctlyunderstanding where things stood with regard to my backups as a resultof my _server's_ problems. And yes, _everything_ on that machine is nowsuspect.

Obviously, if I had it to do over again I would have gone with ECC RAM. But the system is only for personal use (i.e. not supporting any largercommunity of users) and ... well ... you know ... money doesn't grow ontrees.

Also, your archive is NOT entirely lost. Unlike blob-based backupsoftware (like Borg, Restic, etc.), rdiff-backup stores the currentbackup as a normal mirror of readable files. This gives you severalrecovery advantages:
1.

   All uncorrupted files are immediately accessible. You can copy them
   directly using standard file system tools without needing
   rdiff-backup at all. Only the specific corrupted file(s) are
   affected, not the entire archive. But I think you already know that.

Yes. In fact, it appears that under certain circumstances, the archive_can_ be recovered. Specifically, if the current mirror has corruptfiles, and if the originals of those files (i.e. the files of which thecurrent mirror is a copy) have not changed, then one can copy theoriginals over the corrupt files in the current mirror. At least, afterthat 'rdiff-backup verify ...' says "All files verified successfully". For whatever that's worth.


2.

   You can surgically remove the problematic file using
   |rdiff-backup-delete <corrupted_file_path>| to completely delete the
   history of just that corrupted file from the repository. This should
   eliminate the verification error entirely while preserving
   everything else.

Ha! I did _not_ know about 'rdiff-backup-delete'! Thanks for that! That will definitely be useful. Once I get my server fixed I cancontinue to use those archives. I just have to remember that they areno longer complete.

So to answer your question directly: No, the entire archive is notlost. You can recover all non-corrupted files immediately, and youhave options to repair or remove the corrupted file's history.
That said, Eric's suggestion about creating a new baselineperiodically is still excellent practice for long-term backup hygiene,but with good hardware, I can recover files from a 15 years old backupwithout problem.

[...]

Yes. I'm going to have to work that into my back up plan ... such as itis. I had been using 'rsync' to copy all my archives to an entirelyseparate hard drive assuming that hard drive failure was the most likelyfailure mode ... but, of course, the memory errors have also corruptedthat process. Ugh. At least I can easily move that drive to anothercomputer where I can work on them without doing further damage.


Thanks again for all your help.

Cheers
Leland

[...]

On 2/7/26 04:45, Eric L. wrote:
Hi Leland,
everything you write is correct.I would have expected the backupaction to detect when something gets corrupt, at time of writing, butthat's difficult to reproduce and test, so no guarantee (if you knowwhich file, you could check in past backup logs). But even if it'sthe case, that doesn't help you anymore.
The only way to address this would be to create a new repository fromtime to time, to save a new baseline.
KR, Eric

On 04/02/2026 07:48, Leland C. Best wrote:
Hi All,
First, I've used 'rdiff-backup' for a long time (20 years?). I'vehad to use my backups to recover everything from a few accidentallydeleted files to complete system restores to bare metal (althoughother tools are also needed to do the latter). As such, I want tothank everybody who has contributed, and is contributing, to thisoutstanding project.
I have a question about the integrity of a backup archive undercertain conditions.
As I understand it, the current (i.e. most recent) backup is simplya "mirror" of the source directory. The next most recent backup canthen be reconstructed by applying a set of diffs (an "increment"?)to the current backup. Another (additional) set of diffs applied tothat would reconstruct the next most recent backup. And so on.
Lets suppose that, somehow, the current backup (the mirror) becomescorrupted. Given how I think things work in 'rdiff-backup', itseems to me that that would mean the _entire_ archive would becorrupted. That is, doing a 'rdiff-backup regress' would _not_recover the previous backup. Is that correct?
I'm asking because my backup server has developed _very_intermittent memory errors. I only discovered this _because_ an'rdiff-backup verify ...' on the most recent backup failed. [Iultimately verified it was a memory problem via 'memtest86+'.] Theerror was of the form
    ERROR:   Computed SHA1 digest of file <some file>
    '4e45b5128111db53558b1135898386bbaac5c4b2' doesn't match recorded
    digest of 'a671cd065bd97e16b6c5a3cf789e37447fa13fa9'. Your backup
    repository may be corrupted!
The point being that, if I'm understanding correctly, then at thispoint the entire archive is now basically lost. Again, is thiscorrect?
Thanks in advance for any info.

Cheers
Leland

Re: Archive integrity/corruption question.

Reply via email to