Re: Is it safe to use btrfs on top of different types of devices?

Austin S. Hemmelgarn Tue, 17 Oct 2017 12:19:39 -0700

On 2017-10-17 13:06, Adam Borowski wrote:

On Tue, Oct 17, 2017 at 08:40:20AM -0400, Austin S. Hemmelgarn wrote:

On 2017-10-17 07:42, Zoltan wrote:

On Tue, Oct 17, 2017 at 1:26 PM, Austin S. Hemmelgarn
<ahferro...@gmail.com> wrote:

I forget sometimes that people insist on storing large volumes of data on
unreliable storage...


In my opinion the unreliability of the storage is the exact reason for
wanting to use raid1. And I think any problem one encounters with an
unreliable disk can likely happen with more reliable ones as well,
only less frequently, so if I don't feel comfortable using raid1 on an
unreliable medium then I wouldn't trust it on a more reliable one
either.

The thing is that you need some minimum degree of reliability in the other
components in the storage stack for it to be viable to use any given storage
technology.  If you don't meet that minimum degree of reliability, then you
can't count on the reliability guarantees of the storage technology.


The thing is, reliability guarantees required vary WILDLY depending on your
particular use cases.  On one hand, there's "even an one-minute downtime
would cost us mucho $$$s, can't have that!" -- on the other, "it died?
Okay, we got backups, lemme restore it after the weekend".

Yes, but if you are in the second case, you arguably don't needreplication, and would be better served by improving the reliability ofyour underlying storage stack than trying to work around it's problems.Even in that case, your overall reliability is still constrained by theleast reliable component (in more idiomatic terms 'a chain is only asstrong as it's weakest link').

Using replication with a reliable device and a questionable device isessentially the same as trying to add redundancy to a machine by addingan extra linkage that doesn't always work and can get in the way of themain linkage it's supposed to be protecting from failure. Yes, it willwork most of the time, but the system is going to be less reliable thanit is without the 'redundancy'.


Lemme tell you a btrfs blockdev disconnects story.
I have an Odroid-U2, a cheap ARM SoC that, despite being 5 years old and
costing mere $79 (+$89 eMMC...) still beats the performance of much newer
SoCs that have far better theoretical specs, including subsequent Odroids.
After ~1.5 year of CPU-bound stress tests for one program, I switched this
machine to doing Debian package rebuilds, 24/7/365¼, for QA purposes.
Being a moron, I did not realize until pretty late that high parallelism to
keep all cores utilized is still a net performance loss when a memory-hungry
package goes into a swappeathon, even despite the latter being fairly rare.
Thus, I can say disk utilization was pretty much 100%, with almost as much
writing as reading.  The eMMC card endured all of this until very recently
(nowadays it sadly throws errors from time to time).

Thus, I switched the machine to NBD (albeit it sucks on 100Mbit eth).  Alas,
the network driver allocates memory with GFP_NOIO which causes NBD
disconnects (somehow, this doesn't ever happen on swap where GFP_NOIO would
be obvious but on regular filesystem where throwing out userspace memory is
safe).  The disconnects happen around once per week.

Somewhat off-topic, but you might try looking at ATAoE as analternative, it's more reliable in my experience (if you've got areliable network), gives better performance (there's less protocoloverhead than NBD, and it runs on top of layer 2 instead of layer 4),and you can even boot with an ATAoE device as root without needing aninitramfs if you have network auto-configuration in the kernel. Thegeneric server-side component is called 'vblade', and you actually don'tneed anything on the client side other than the `aoe` kernel module(loading the module scans for devices automatically, and you can easilymanage things through the various nodes it creates in /dev).


It's a single-device filesystem, thus disconnects are obviously fatal.  But,
they never caused even a single bit of damage (as scrub goes), thus proving
btrfs handles this kind of disconnects well.  Unlike times past, the kernel
doesn't get confused thus no reboot is needed, merely an unmount, "service
nbd-client restart", mount, restart the rebuild jobs.

That's expected behavior though. _Single_ device BTRFS has nothing toget out of sync most of the time, the only time there's any possibilityof an issue is when you die after writing the first copy of a blockthat's in a dup profile chunk, but even that is not very likely to causeproblems (you'll just lose at most the last <commit-time> worth ofdata). The moment you add another device though, that simplicity goesout the window.


I also can recreate this filesystem and the build environment on it with
just a few commands, thus, unlike /, there's no need for backups.  But I
had no need to recreate it yet.

This is single-device not RAID5, but it's a good example for an use case
where an unreliable storage medium is acceptable (even if the GFP_NOIO issue
is still worth fixing).

Again, single device mode with BTRFS handles it just fine (at least, ithandles it just as well as any other filesystem does). Multi-deviceBTRFS is the issue here.

--
To unsubscribe from this list: send the line "unsubscribe linux-btrfs" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Re: Is it safe to use btrfs on top of different types of devices?

Reply via email to