[gentoo-user] Solid state disks...

Steve Sun, 22 Feb 2009 05:48:47 -0800

I'm playing around with an application that requires me to manage alarge (multi-gigabyte to terabyte), bespoke, frequently-updating datastructure in real-time... key concerns are for durability andefficiency. While a traditional approach might be to employ anexpensive DBMS on expensive hardware... I'm looking to be moreinnovative. I want to achieve big-iron beating performance on ashoestring budget... and I'm optimistic since the problem domain doesn'ttranslate well to traditional RDBMS approaches.

An obvious alternative to a DBMS is to use the file-system directly...in principle this could work - but it would be a laborious processfraught with potential pitfalls with respect to atomicity of updates,transactional recovery (in case of a fail-stop while processing a largeupdate) etc. Another issue is that in order to establish an efficientand reliable implementation, it becomes necessary to second guessdetails about the implementation of file-systems... this vastlycomplicates any implementation and might render it unacceptably fragile(subject to unexpected deviations in behaviour as the implementation ismoved between hardware/OS-versions etc.

I've recently discovered that SSDs are becoming more affordable... andthis might present new options. There were major hurdles in attemptingto establish a strategy to interact with hard-disk block devices...including, but not limited to, a significant difficulty in establishingthe extent to which locality of reference affected performance. Anotherworry was that it might be difficult to establish that a write hadactually completed (i.e. the data reliably and durably stored - not justthat the responsibility for recording the data was now exclusively withthe drive.) My hope is that SSD technology simplifies some of theseconcerns - allowing a clear model for access performance that shouldallow an efficient and reliable implementation.

I'd like to hear about anyone who has experience with configuring SSDsfor use with (Gentoo) Linux - and especially from anyone who'sinvestigated performance issues. I've read that SSDs typically have a64Kib block size... this would work fine for me (though I understandthat it is a significant impediment for high performance with existingfile systems. I'd be interested to know if anyone has done performanceanalysis of SSDs at the device level under Linux... and am intrigued ifthere is more to interacting with them than establishing the block sizefrom manufacturer data - then reading/writing appropriately many bytesfrom block devices... and/or flushing appropriately aligned and sizedblocks of memory mapped data. For example, is there an interface toquiz an SSD about its block-size? I'm intrigued to establish if I canrely upon my data being durably stored on an SSD when a flush/write returns.

In a practical sense, I'd like to experiment with some SSD hardware -but there seems to be a lot to chose from. For development purposes,I'd not need more than, say, 32GB - and I'm not all that fussed aboutabsolute performance - as long as the relative performance of variousinteractions will increase proportionally were I to move to moreexpensive SSDs in future. I'm interested to establish any practicalanecdotes (or hard statistical data) about the relative merits ofvarious interfaces for SSDs - and to establish if RAID needs to be takeninto account when establishing a performance model.

Any feedback would be appreciated... especially from any gentooist whois interested in SSD performance/reliability/configuration.

[gentoo-user] Solid state disks...

Reply via email to