Re: btrfs-cleaner / snapshot performance analysis

Austin S. Hemmelgarn Mon, 12 Feb 2018 08:02:52 -0800

On 2018-02-12 10:37, Ellis H. Wilson III wrote:

On 02/11/2018 01:24 PM, Hans van Kranenburg wrote:

Why not just use `btrfs fi du <subvol> <snap1> <snap2>` now and then and
update your administration with the results? .. Instead of putting the
burden of keeping track of all administration during every tiny change
all day long?

I will look into that if using built-in group capacity functionalityproves to be truly untenable. Thanks!

As a general rule, unless you really need to actively prevent asubvolume from exceeding it's quota, this will generally be morereliable and have much less performance impact than using qgroups.

CoW is still valuable for us as we're shooting to support on the order
of hundreds of snapshots per subvolume,
Hundreds will get you into trouble even without qgroups.
I should have been more specific. We are looking to use up to a fewdozen snapshots per subvolume, but will have many (tens to hundreds of)discrete subvolumes (each with up to a few dozen snapshots) in a BTRFSfilesystem. If I have it wrong and the scalability issues in BTRFS donot solely apply to subvolumes and their snapshot counts, please let meknow.

The issue isn't so much total number of snapshots as it is how manysnapshots are sharing data. If each of your individual subvolumesshares no data with any of the others via reflinks (so no deduplicationacross subvolumes, and no copying files around using reflinks or theclone ioctl), then I would expect things will be just fine withoutqgroups provided that you're not deleting huge numbers of snapshots atthe same time.

With qgroups involved, I really can't say for certain, as I've neverdone much with them myself, but based on my understanding of how it allworks, I would expect multiple subvolumes with a small number ofsnapshots each to not have as many performance issues as a singlesubvolume with the same total number of snapshots.

I will note you focused on my tiny desktop filesystem when making someof your previous comments -- this is why I didn't want to share specificdetails. Our filesystem will be RAID0 with six large HDDs (12TB each).Reliability concerns do not apply to our situation for technicalreasons, but if there are capacity scaling issues with BTRFS I should bemade aware of, I'd be glad to hear them. I have not seen any intechnical documentation of such a limit, and experiments so far on 6x6TBarrays has not shown any performance problems, so I'm inclined tobelieve the only scaling issue exists with reflinks. Correct me if I'mwrong.

BTRFS in general works fine at that scale, dependent of course on thelevel of concurrent access you need to support. Each tree update needsto lock a bunch of things in the tree itself, and having large numbersof clients writing to the same set of files concurrently can cause lockcontention issues because of this, especially if all of them are callingfsync() or fdatasync() regularly. These issues can be mitigated bysegregating workloads into their own subvolumes (each subvolume is amostly independent filesystem tree), but it sounds like you're alreadydoing that, so I don't think that would be an issue for you.

The only other possibility I can think of is that the performance hitfrom qgroups may scale not just based on the number of snapshots of agiven subvolume, but also the total size of the subvolume (more datameans more accounting work), though I'm not certain about that (it'sjust a hunch based on what I do know about qgroups).

Now, there are some other odd theoretical cases that may cause issueswhen dealing with really big filesystems, but they're either reallyspecific edge cases (for example, starting with a really smallfilesystem and gradually scaling it up in size as it gets full) orhappen at scales far larger than what you're talking about (on the orderof at least double digit petabyte scale).

--
To unsubscribe from this list: send the line "unsubscribe linux-btrfs" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Re: btrfs-cleaner / snapshot performance analysis

Reply via email to