Re: Why use RAID anymore?

kelsey hudson Wed, 19 Dec 2007 09:49:17 -0800

James G. Sack (jim) wrote:

My understanding is that software raids do parallel access effectively
the same as hardware controllers. But hw offloads the parity and error

This isn't necessarily true, and the reason for this is below, but it'sa bit of an explanation. Basically, it has to do with the way mostnormal architectures (especially x86{,_64}) handle hardware interrupts.

The way most proper raid controllers are built involves a bit ofencapsulation. First, a device that sits on the host's PCI bus whichserves as an interface between the OS and the RAID function. That devicenormally has a fast, specialized CPU which does all the xor/xsum/DMA/IOoperation reordering, etc. This CPU will typically have its own memoryand PCI bus. On that PCI bus will be standard (or sometimes specialized)interface chipsets which actually talk to the disks in question. Here'swhere the difference in parallelism comes out: the specialized CPU onthe raid board will be able to more efficiently handle the interruptsgiven to it. Normally, these CPUs will have some sort of vectoredinterrupt table so it will be able to service them in parallel. x86hardware, especially, lacks this. IOAPIC is a poor substitute for doingit properly; the CPU can still only sequentially service interrupts. Notto mention the fact that it has to compete with things like, oh,userland processes that may be feeding data to the disks as fast aspossible. Or the user moving the mouse across the screen, resizing awindow, whatever. Read: it has better things to do.

Not only this, but I mentioned IO operation reordering. The hardwarecontrollers are *very* good about understanding that seeks are bad andto avoid them at all costs. So, it'll internally reorder all the IOoperations sent to it before flushing it to disk or doing a media read.Your Linux kernel also does this; it has several flavours available todo so. The one used by default since 2.6.16 is cfq, or complete fairqueueing. It's a good mix of performance for multi-use systems. There'salso as, or anticipatory. When IOs are performed, this scheduler waitsfor a small period to see if more IOs are coming to that same area ofthe media. If so, it'll service those before servicing anything else.It's great for desktop systems that typically do one thing at a time.It's terrible for multi-use systems because it results in IO starvationfairly quickly. There's a third scheduler, deadline, which is great fordatabase loads and mixed-mode-access. IOs come in, and are set aside,but given a hard deadline for completion. More IOs will come in, and thescheduler will reorder them. When the timer is up for the pendingoperation, it'll schedule all IOs in the same area to go to disk atonce. But, since there's a guarantee that IOs will be serviced in acertain amount of time, you don't get the starvation like when using as.It's a great scheduler for file servers and database boxes. The fourthoption in the default kernel is noop, which is exactly what the namesays: a no-op. IOs are flushed/read to/from media as soon as they'rereceived(*) without being reordered. It's great for devices where thereis no seek penalty (like flash). It's also great for hardware RAIDcontrollers, because their scheduling is done on-board. No reason forthe host CPU to deal with reordering IOs if the board is just going toreorder them as soon as it receives them. Better yet, is you can selectwhich IO scheduler to use both at boot-time (elevator=name on the kernelcommand line) and on-the-fly via sysfs (since 2.6.17,/sys/block/<device>/queue/iosched controls this).

(*) this isn't entirely true; Linux has an aggressive write-behindcaching layer.

checking/handling onto the cpu in the controller, thus gaining performance..

I've seen throughput increase 2-3 times, and system load decrease by anappropriate number just by switching from software to hardware RAID.This is equally funny, because the disks didn't change at all, nor didthe controller chips that were directly accessing them (I was usingMarvell SATA controllers on the host; turns out my RAID board of choicealso uses these internally).

- a. protection from damaging meddling (direct disk access) from other
sw on the host

Right. You can easily dd over the wrong disk with a mis-placed argument.No amount of software raid will prevent this.

& (usually)

- b. prevention of debugging, monitoring or exploratory recovery operations.


Right, this is all done on the card's ROM.

However, this isn't to completely discount software raid; it definitelyhas its place. I just don't use it where extremely high performancerequirements are necessary. Often, my systems have better things to dothan service storage interrupts all day long and reorder IOs they don'tneed to reorder.


-kelsey


--
[email protected]
http://www.kernel-panic.org/cgi-bin/mailman/listinfo/kplug-list

Re: Why use RAID anymore?

Reply via email to