Re: [PERFORM] SSD + RAID

Greg Smith Mon, 01 Mar 2010 22:14:12 -0800

Bruce Momjian wrote:

I always assumed SCSI disks had a write-through cache and therefore
didn't need a drive cache flush comment.

There's more detail on all this mess athttp://wiki.postgresql.org/wiki/SCSI_vs._IDE/SATA_Disks and it includesthis perception, which I've recently come to believe isn't actuallycorrect anymore. Like the IDE crowd, it looks like one day somebodysaid "hey, we lose every write heavy benchmark badly because we onlyhave a write-through cache", and that principle got lost along thewayside. What has been true, and I'm staring to think this is whatwe've all been observing rather than a write-through cache, is that theproper cache flushing commands have been there in working form for somuch longer that it's more likely your SCSI driver and drive do theright thing if the filesystem asks them to. SCSI SYNCHRONIZE CACHE hasa much longer and prouder history than IDE's FLUSH_CACHE and SATA'sFLUSH_CACHE_EXT.

It's also worth noting that many current SAS drives, the current SCSIincarnation, are basically SATA drives with a bridge chipset stuck ontothem, or with just the interface board swapped out. This one reason whytop-end SAS capacities lag behind consumer SATA drives. They use theconsumers as beta testers to get the really fundamental firmware issuessorted out, and once things are stable they start stamping out theversion with the SAS interface instead. (Note that there's a parallelmanufacturing approach that makes much smaller SAS drives, the 2.5"server models or those at higher RPMs, that doesn't go through thispath. Those are also the really expensive models, due to economy ofscale issues). The idea that these would have fundamentally differentwrite cache behavior doesn't really follow from that development model.

At this point, there are only two common differences between "consumer"and "enterprise" hard drives of the same size and RPM when there aredirectly matching ones:

1) You might get SAS instead of SATA as the interface, which providesthe more mature command set I was talking about above--and therefore maygive you a sane write-back cache with proper flushing, which is all thedatabase really expects.

2) The timeouts when there's a read/write problem are tuned down in theenterprise version, to be more compatible with RAID setups where youwant to push the drive off-line when this happens rather than presumingyou can fix it. Consumers would prefer that the drive spent a lot oftime doing heroics to try and save their sole copy of the apparentlymissing data.

You might get a slightly higher grade of parts if you're lucky too; Iwouldn't count on it though. That seems to be saved for the high RPM orsmaller size drives only.


--
Greg Smith  2ndQuadrant US  Baltimore, MD
PostgreSQL Training, Services and Support
[email protected]   www.2ndQuadrant.us


--
Sent via pgsql-performance mailing list ([email protected])
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-performance

Re: [PERFORM] SSD + RAID

Reply via email to