Re: Some very basic questions

Ric Wheeler Wed, 22 Oct 2008 08:09:53 -0700

Avi Kivity wrote:

Ric Wheeler wrote:
You want to have spare capacity, enough for one or two (or fifteen)drives' worth of data. When a drive goes bad, you rebuild into thespare capacity you have.
That is a different model (and one that makes sense, we used that inCentera for object level protection schemes). It is a nice model aswell, but not how most storage works today.
Well, btrfs is not about duplicating how most storage works today.Spare capacity has significant advantages over spare disks, such asbeing able to mix disk sizes, RAID levels, and better performance.

Sure, there are advantages that go in favour of one or the otherapproaches. But btrfs is also about being able to use common hardwareconfigurations without having to reinvent where we can avoid it (if wehave a working RAID or enough drives to do RAID5 with spares or RAID6,we want to be able to delegate that off to something else if we can).

When you replace the drive, the filesystem moves data into the newdrive to take advantage of the new spindle.
When you buy a storage solution (hardware or software), the key hereis "utilized capacity." If you have an enclosure that can host say12-15 drives in a 2U enclosure, people normally leave one drive asspare. RAID6 is another way to do this. You can do a 4+2 and 4+2with 66% utilized capacity in RAID 6 or possibly a RAID5 scheme usinglike 5+1 and 4+1 with one global spare (75% utilized capacity).
That gives you the chance to do rebuild your RAID group withouthaving to physically visit the data center. You can also do fancystuff with the spare (like migrate as many blocks as possible beforethe RAID rebuild to that spare) which reduces your exposure to the2nd drive failure and speeds up your rebuild time.
In the end, whether you use a block based RAID solution or an objectbased solution, you just need to figure out how to balance yourutilized capacity against performance and data integrity needs.
In both models (spare disk and spare capacity) the storage utilizationis the same, or nearly so. But with spare capacity you get betterperformance since you have more spindles seeking for your data, andsince less of the disk surface is occupied by data, making your seeksshorter.

True, you can get more performance if you use all of the hardware youhave all of the time.

The major difficulty with the spare capacity model is that your recoveryis not as simple and well understood as RAID rebuilds. If you assumethat whole drives fail under btrfs mirroring, you are not really doinganything more than simple RAID, or do I misunderstand your suggestion?

I don't see the point about head seeking. In RAID, you also have thesame layout so you minimize head movement (just move more heads per IOin parallel).


ric
--
To unsubscribe from this list: send the line "unsubscribe linux-btrfs" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Re: Some very basic questions

Reply via email to