[ 
https://issues.apache.org/jira/browse/CASSANDRA-8460?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16414547#comment-16414547
 ] 

Ben Slater commented on CASSANDRA-8460:
---------------------------------------

Hi John

I've been setting the requirements from our (Instaclustr) point of view for 
Lerh here so I thought I'd weigh in on why I'd rather see a Cassandra based 
solution than LVM.

The requirement we're looking to target, as per the original JIRA, is people 
who have data that is hot for a short period but then they need to keep around 
for a long time with infrequent access (ie well defined rules on hot vs cold, 
not deciding what is hot based on what was recently read).

Typically when I've seen this requirement people want: 
1) The best possible performance for the hot data
2) Lowest cost of storage for the cold data

It seems to me that with LVM we're a not doing the best we could in terms of 
either of these.

For performance, there is the write-through slow down you mentioned, depending 
on where you draw the line on moving to slow disk vs the final TWCS compaction 
you might have compactions pushing data you want to be quick out of cache and 
if you used EBS for both the hot disk and the slow disk you are increasing 
usage of the EBS bandwidth to copy to and from cache (although using local SSD 
as the cache negates this last one).

In terms of cost, with LVM the fast disk is purely being used as cache rather 
than a primary store so you are having to duplicate that amount of data storage 
- whether that is significant probably depends on your desired ratio of fast to 
slow disk and how cost sensitive you are.

Whether this downsides are worth the extra complexity is of course a matter of 
judgement rather than facts so happy to go with the community consensus here 
but thought I'd put in my POV.

 

Cheers

Ben

> Make it possible to move non-compacting sstables to slow/big storage in DTCS
> ----------------------------------------------------------------------------
>
>                 Key: CASSANDRA-8460
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-8460
>             Project: Cassandra
>          Issue Type: Improvement
>            Reporter: Marcus Eriksson
>            Assignee: Lerh Chuan Low
>            Priority: Major
>              Labels: doc-impacting, dtcs
>             Fix For: 4.x
>
>
> It would be nice if we could configure DTCS to have a set of extra data 
> directories where we move the sstables once they are older than 
> max_sstable_age_days. 
> This would enable users to have a quick, small SSD for hot, new data, and big 
> spinning disks for data that is rarely read and never compacted.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org
For additional commands, e-mail: commits-h...@cassandra.apache.org

Reply via email to