Re: [illumos-Discuss] md aka svm aka lvm

Haudy Kazemi Sun, 29 Aug 2010 14:28:05 -0700

Kyle and Garrett: we are not disagreeing. Your responses agree with myown response.


Garrett D'Amore wrote:

I tend to agree... anyone believing that JBOD concatenation gives a
better sense of reliability probably misunderstands how filesystem
metadata (and potentially even block data for the files themselves) is
scattered around the filesystem, and setting themselves up for failure.

That's why I raised the caveats about file fragmentation and filesystemtables/metadata. These caveats effectively make the risk of usingconcatenation similar to the risk of RAID0. Theoretically, there isstill a slightly lower risk to using concatenation because there is agreater chance that file carving techniques will succeed. Practically,the caveats will have a noticeable effect, as anyone who has attemptedfile carving (even on a single drive) will know well.

Concatenation as a way to "reduce" points of failure is a mistake.   If
you want reliability, then don't use RAID0 or concatenation, unless
using mirrors underneath or somesuch.

I would not characterize concatenation as being intended for reducingfailure points, rather it is more a means to easily grow an array, andto theoretically make salvaging data easier. When speaking ofsalvaging, we are talking about minimizing damage that has occurred,rather than reliability intended to prevent nonrecoverable damage fromoccurring to begin with.

The marginal difference in reliability between concat and RAID0 issmall. It should not be considered as having much if any value when thestored data is otherwise valuable or irreplaceable. In my opinion,concatenation/JBOD's 'safety' factor over RAID0 is overvalued because ofthe caveats pointed out before. The effects of the listed caveats onJBOD recovery are under recognized and under appreciated.JBOD/concatenation is likely a result of ease implementation and arrayexpansion issues vs RAID0 more so than anything else.

A better intermediary option is as Karl describes: one filesystem pernon-redundant disk, which at least guarantees a compartmentalization ofdamage.


More below.

I'll allow that there may be other reasons that concatenation is
preferable to RAID0, but I *suspect* that most people who are doing so
are often mistaken about filesystem optimization.  I suspect that in the
vast majority of cases it is better to let the filesystem lay things out
for you.  (In an ideal world the filesystem would be able to monitor
disk activity and move things around when it finds one spindle more
heavily used than another.)

        - Garrett


On Sun, 2010-08-29 at 15:24 -0400, Kyle McDonald wrote:
On 8/29/2010 2:53 PM, Haudy Kazemi wrote:
RAID0 = striping
JBOD = straight concatenation
Neither has any redundancy, however the potential impact of a failureis different. JBOD failure has the potential of being less severethan RAID0 failure. With JBOD, most likely you will only lose thecontent of single drive that failed (the remaining content has somechance of being recoverable). With RAID0, you lose everything largerthan the stripe width, which means any medium or large files, becausethey have been striped across multiple drives. The smaller files fitwithin a stripe, so they should still be recoverable assuming thedrive they ended up on is still working. (Actually, with RAID0, afailed drive just about guarantees your medium and large files haveholes in them, while with JBOD those files might have holes in thembecause of fragmentation.)Some caveats that apply are the effects of file fragmentation and thepotential loss of filesystem tables/metadata. In either case, if youlose the filesystem tables/metadata, you will need to file carve outanything that remains, and file carving doesn't work very well onfragmented files.
The idea that the data on one disk would still be recoverable seems astretch to me. While it may be readable, in my expeirience with SVMaccessing the data is not going to be simple - SVM isn't going to helpyou out though dd might. On top of that, while it's not striped in theregular way, there is still no guarantee that all the blocks of the fileyou're interested in will be on the surviving disk. UFS tries to do thatsomewhat, but on a long lived FS it's ability to do that will belimited. Even if a file is all on one disk, you have no easy way of knowwhich it's on.

With a concatenated array of disks (assuming zero fragmentation and noloss of important metadata), you will lose what ever files were on thefailed disk. You don't get to chose the files that survive...all havean approximately equal chance of being lost regardless of which ones youare more interested in. If the filesystem tables/metadata are intact,you will know which files are affected by looking up the block addressesassociated with the file and then seeing which disk those translate to.If the filesystem tables/metadata is lost, you'll get back whatever thefile carving software can find using file type signatures and heuristicsof where the file ends.

So the net effect to me isn't that great. I've always stayed away fromboth RAID0 and Concatenation. While it does decrease the flexibility ofspace usage, If I've had multiple disks and didn't want to haveredundancy and didn't need the the performance boost, I've always justpartitioned and made a FS on each disk and mounted them on the system.That's really the only way to salvage one disks worth of data when theother one fails. It's the only way to know what files are on which disk,and ensure that each file is completely on one disk.

I agree. That is a strategy I myself have used for storing replaceableor low value data where losing one disk's worth of data has a tolerabletime/hassle/annoyance factor, but replacing many disk's worth would havean unacceptable tolerable time/hassle/annoyance factor.




_______________________________________________
Discuss mailing list
[email protected]
http://lists.illumos.org/m/listinfo/discuss

Re: [illumos-Discuss] md aka svm aka lvm

Reply via email to