Feature requests: online backup - defrag - change RAID level

zedlryqc Sun, 08 Sep 2019 20:32:22 -0700

Hello everyone!

I have been programming for a long time (over 20 years), and I amquite interested in a lot of low-level stuff. But in reality I havenever done anything related to kernels or filesystems. But I did a lotof assembly, C, OS stuff etc...

Looking at your project status page (athttps://btrfs.wiki.kernel.org/index.php/Status), I must say that yourpriorities don't quite match mine. Of course, the opinions usuallydiffer. It is my opinion that there are some quite essential featureswhich btrfs is, unfortunately, still missing.

So here is a list of features which I would rate as very important(for a modern COW filesystem like btrfs is), so perhaps you can thinkabout it at least a little bit.

 
1) Full online backup (or copy, whatever you want to call it)
btrfs backup <filesystem name> <partition name> [-f]

- backups a btrfs filesystem given by <filesystem name> to a partition<partition name> (with all subvolumes).

- To be performed by creating a new btrfs filesystem in thedestination partition <partition name>, with a new GUID.- All data from the source filesystem <filesystem name> is than copiedto the destination partition, similar to how RAID1 works.- The size of the destination partition must be sufficient to hold theused data from the source filesystem, otherwise the operation fails.The point is that the destination doesn't have to be as large assource, just sufficient to hold the data (of course, many details andconcerns are skipped in this short proposal)- When the operation completes, the destination partition contains afully featured, mountable and unmountable btrfs filesystem, which isan exact copy of the source filesystem at some point in time, with allthe snapshots and subvolumes of the source filesystem.- There are two possible implementations about how this operation isto be performed, depending on whether the destination drive is slowerthan source drive(s) or not (like, when the destination is HDD and thesource is SDD). If the source and the destination are of similarspeed, than a RAID1-alike algorithm can be used (all writessimultaneously go to the source and the destination). This mode canalso be used if the user/admin is willing to tolerate a performancehit for some relatively short period of time.The second possible implementation is a bit more complex, it can bedone by creating a temporary snapshot or by buffering all the currentwrites until they can be written to the destination drive, but thisimplementation is of lesser priority (see if you can make the RAID1implementation work first).

 
2) Sensible defrag

The defrag is currently a joke. If you use defrag than you better notuse subvolumes/snapshots. That's... very… hard to tolerate. Quite anecessary feature. I mean, defrag is an operation that should beperformed in many circumstances, and in many cases it is evenautomatically initiated. But, btrfs defrag is virtually unusable. And,it is unusable where it is most needed, as the presence of subvolumeswill, predictably, increase fragmentation by quite a lot.

 
How to do it:

- The extents must not be unshared, but just shuffled a bit. Unsharingthe extents is, in most situations, not tolerable.

- The defrag should work by doing a full defrag of one 'selectedsubvolume' (which can be selected by user, or it can be guessedbecause the user probably wants to defrag the currently mountedsubvolume, or default subvolume). The other subvolumes should thanshare data (shared extents) with the 'selected subvolume' (as much aspossible).

- If you want it even more feature-full and complicated, then youcould allow the user to specify a list of selected subvolumes, like:subvol1, subvol2, subvol3… etc. and the defrag algorithm than defragssubvol1 in full, than subvol2 as much as possible while not changingsubvol1 and at the same time sharing extents with subvol1, than defragsubvol3 while not changing subvol1 and subvol2… etc.

- I think it would be wrong to use a general deduplication algorithmfor this. Instead, the information about the shared extents should beanalyzed given the starting state of the filesystem, and than thealgorithm should produce an optimal solution based on the currentlyshared extents.

 
Deduplication is a different task.

3) Downgrade to 'single' or 'DUP' (also, general easy way to switchbetween RAID levels)

Currently, as much as I gather, user has to do a "btrfs balance start-dconvert=single -mconvert=single

", than delete a drive, which is a bit ridiculous sequence of operations.

Can you do something like "btrfs delete", but such that it alsosimultaneously converts to 'single', or some other chosen RAID level?

## I hope that you will consider my suggestions, I hope that I'mhelpful (although, I guess, the short time I spent working with btrfsand writing this mail can not compare with the amount of work you areputting into it). Perhaps, teams sometimes need a differentperspective, outsiders perspective, in order to better understand thesituation.

 
So long!

Feature requests: online backup - defrag - change RAID level

Reply via email to