[ https://issues.apache.org/jira/browse/CASSANDRA-8460?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16414547#comment-16414547 ]
Ben Slater commented on CASSANDRA-8460: --------------------------------------- Hi John I've been setting the requirements from our (Instaclustr) point of view for Lerh here so I thought I'd weigh in on why I'd rather see a Cassandra based solution than LVM. The requirement we're looking to target, as per the original JIRA, is people who have data that is hot for a short period but then they need to keep around for a long time with infrequent access (ie well defined rules on hot vs cold, not deciding what is hot based on what was recently read). Typically when I've seen this requirement people want: 1) The best possible performance for the hot data 2) Lowest cost of storage for the cold data It seems to me that with LVM we're a not doing the best we could in terms of either of these. For performance, there is the write-through slow down you mentioned, depending on where you draw the line on moving to slow disk vs the final TWCS compaction you might have compactions pushing data you want to be quick out of cache and if you used EBS for both the hot disk and the slow disk you are increasing usage of the EBS bandwidth to copy to and from cache (although using local SSD as the cache negates this last one). In terms of cost, with LVM the fast disk is purely being used as cache rather than a primary store so you are having to duplicate that amount of data storage - whether that is significant probably depends on your desired ratio of fast to slow disk and how cost sensitive you are. Whether this downsides are worth the extra complexity is of course a matter of judgement rather than facts so happy to go with the community consensus here but thought I'd put in my POV. Cheers Ben > Make it possible to move non-compacting sstables to slow/big storage in DTCS > ---------------------------------------------------------------------------- > > Key: CASSANDRA-8460 > URL: https://issues.apache.org/jira/browse/CASSANDRA-8460 > Project: Cassandra > Issue Type: Improvement > Reporter: Marcus Eriksson > Assignee: Lerh Chuan Low > Priority: Major > Labels: doc-impacting, dtcs > Fix For: 4.x > > > It would be nice if we could configure DTCS to have a set of extra data > directories where we move the sstables once they are older than > max_sstable_age_days. > This would enable users to have a quick, small SSD for hot, new data, and big > spinning disks for data that is rarely read and never compacted. -- This message was sent by Atlassian JIRA (v7.6.3#76005) --------------------------------------------------------------------- To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandra.apache.org