Re: [zfs-discuss] One LUN per RAID group

Erik Trimble Tue, 15 Feb 2011 17:38:50 -0800

On 2/15/2011 1:37 PM, Torrey McMahon wrote:

On 2/14/2011 10:37 PM, Erik Trimble wrote:
That said, given that SAN NVRAM caches are true write caches (and nota ZIL-like thing), it should be relatively simple to swamp one withwrite requests (most SANs have little more than 1GB of cache), atwhich point, the SAN will be blocking on flushing its cache to disk.
Actually, most array controllers now have 10s if not 100s of GB ofcache. The 6780 has 32GB, DMX-4 has - if I remember correctly - 256.The latest HDS box is probably close if not more.
Of course you still have to flush to disk and the cache flushalgorithms of the boxes themselves come into play but 1GB was a longtime ago.
_______________________________________________
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss



STK2540 and the STK6140 have at most 1GB.
STK6180 has 4GB.

The move to large GB caches is only recent - only large (i.e big arraysetups with a dedicated SAN head) have had multi-GB NVRAM cache for anylength of time.

In particular, pretty much all base arrays still have 4GB or less on theenclosure controller - only in the SAN heads do you find big multi-GBcaches. And, lots (I'm going to be brave and say the vast majority) ofZFS deployments use direct-attach arrays or internal storage, ratherthan large SAN configs. Lots of places with older SAN heads are alsogoing to have much smaller caches. Given the price tag of most largeSANs, I'm thinking that there are still huge numbers of 5+ year-old SANsout there, and practically all of them have only a dozen or less GB ofcache.

So, yes, big SAN modern configurations have lots of cache. But they'realso the ones most likely to be hammered with huge amounts of I/O frommultiple machines. All of which makes it relatively easy to blow throughthe cache capacity and slow I/O back down to the disk speed.

Once you get back down to raw disk speed, having multiple LUNS per raidarray is almost certainly going to perform worse than a single LUN, dueto thrashing. That is, it would certainly be better (i.e. faster) foran array to have to commit 1 128k slab than 4 32k slabs.

So, the original recommendation is interesting, but needs to have thecaveat that you'd really only use it if you can either limit the amountof sustained I/O you have, or are using very-large-cache disk setups.

I would think it idea might also apply (i.e. be useful) for somethinglike the F5100 or similar RAM/Flash arrays.


--
Erik Trimble
Java System Support
Mailstop:  usca22-123
Phone:  x17195
Santa Clara, CA

_______________________________________________
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Re: [zfs-discuss] One LUN per RAID group

Reply via email to