Re: fsync, rdiff-backup, wapbl, and WD Elements 1T drive

Alan Barrett Fri, 28 Oct 2011 22:55:03 -0700

Matthew Mondor wrote:

Greg Troxel <[email protected]> wrote:
So, I'm inclined to patch rdiff-backup not to fsync, since itseems excessive, and the backup is toast if the machine crashesbefore it is finished -- in that case rdiff-backup just rollsback. Opinions?
I also wonder why fsync would be used for every file, especiallyif you consider a whole run a single "transaction", even more soif using snapshots (although you don't mention using them).

If rdiff-backup was easily able to roll back after a crash, thenI'd probably agree with the above. But it's expensive to rollback (you have to compare the actual data in the files, withoutassuming that {same size, same mtime} implies same data).

The current state of ffs+wabl is that, if the system crashes andthe log is replayed, then files that had been written shortlybefore the crash end up with whatever old data happened to bein the underlying disk blocks, but new metadata indicating thatthe size and timestamps are all up to date. I think that thisviolates traditional unix file system semantics, but the peoplewho worked on wapbl don't seem to think it's a problem.

Anyway, the new metadata with old data tends to make rsync (andprobably rdiff-backup) think that the file is up to date, andso not copy it again next time (unless you perform an expensivecomparison of all the data, nit just the metadata).

I have patched rsync to issue fdatasync(2) calls frequently,to mitigate this problem in my own usage. It does slow itdown, but nowhere near as dramatically as you report. (I useNetBSD-current.)


--apb (Alan Barrett)

Re: fsync, rdiff-backup, wapbl, and WD Elements 1T drive

Reply via email to