[ 
https://issues.apache.org/jira/browse/CASSANDRA-5515?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13714563#comment-13714563
 ] 

Jonathan Ellis commented on CASSANDRA-5515:
-------------------------------------------

Either time-based throttle or sync-on-compaction seems reasonable to me.  
Leaning towards the former since write:read ratio may be low.

If we're not going to take the simple approach with a map, should we keep more 
data like this?

{code}
CREATE TABLE sstable_activity (
    keyspace text,
    columnfamily text,
    generation int,
    hour_ending_at timestamp,
    reads int,
    PRIMARY KEY ((keyspace, columnfamily, generation), hour_ending_at)
)
{code}

Delete-on-removal + TTL sounds right.
                
> Track sstable coldness
> ----------------------
>
>                 Key: CASSANDRA-5515
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-5515
>             Project: Cassandra
>          Issue Type: New Feature
>          Components: Core
>            Reporter: Jonathan Ellis
>            Assignee: Tyler Hobbs
>             Fix For: 2.0.1
>
>         Attachments: 0001-Track-row-read-counts-in-SSTR.patch
>
>
> Keeping a count of reads per-sstable would allow STCS to automatically ignore 
> cold data rather than recompacting it constantly with hot data, dramatically 
> reducing compaction load for typical time series applications and others with 
> time-correlated access patterns.  We would not need a separate age-tiered 
> compaction strategy.
> (This will really be useful in conjunction with CASSANDRA-5514.)

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to