[ 
https://issues.apache.org/jira/browse/CASSANDRA-11035?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15154579#comment-15154579
 ] 

Tyler Hobbs commented on CASSANDRA-11035:
-----------------------------------------

In combination with a minimum threshold, we could also potentially increase the 
sstable size variation within a bucket.  In other words, instead of requiring 
sstables in a bucket to be within 50% of the average size, allow them to be 
within 75% of the average size.  If we know that sstables have a lot of 
overlap, it makes sense to be a little more flexible about the sizes.

However, if we do implement a minimum threshold, we should probably add a cap 
on the max number of sstables (for STCS, at least).  Otherwise, you might see 
unbounded sstable growth for something like a Users table.

> Use cardinality estimation to pick better compaction candidates for STCS 
> (SizeTieredCompactionStrategy)
> -------------------------------------------------------------------------------------------------------
>
>                 Key: CASSANDRA-11035
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-11035
>             Project: Cassandra
>          Issue Type: Improvement
>          Components: Compaction
>            Reporter: Wei Deng
>            Assignee: Marcus Eriksson
>
> This was initially mentioned in this blog post 
> http://www.datastax.com/dev/blog/improving-compaction-in-cassandra-with-cardinality-estimation
>  but I couldn't find any existing JIRA for it. As stated by [~jbellis], 
> "Potentially even more useful would be using cardinality estimation to pick 
> better compaction candidates. Instead of blindly merging sstables of a 
> similar size a la SizeTieredCompactionStrategy." The L0 STCS in LCS should 
> benefit as well.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to