[ 
https://issues.apache.org/jira/browse/CASSANDRA-11035?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15131512#comment-15131512
 ] 

Jonathan Ellis commented on CASSANDRA-11035:
--------------------------------------------

The problem here was, you end up doing quadratic work comparing each sstable to 
each other to find the best candidates to merge.  So the question is, do we try 
to come up with a clever way to avoid this?  Or do we go ahead and brute force 
it, which would require updating HyperLogLog to use off-heap registers?  (The 
latter actually looks pretty easy, now that I check the source.)

> Use cardinality estimation to pick better compaction candidates for STCS 
> (SizeTieredCompactionStrategy)
> -------------------------------------------------------------------------------------------------------
>
>                 Key: CASSANDRA-11035
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-11035
>             Project: Cassandra
>          Issue Type: Improvement
>          Components: Compaction
>            Reporter: Wei Deng
>
> This was initially mentioned in this blog post 
> http://www.datastax.com/dev/blog/improving-compaction-in-cassandra-with-cardinality-estimation
>  but I couldn't find any existing JIRA for it. As stated by [~jbellis], 
> "Potentially even more useful would be using cardinality estimation to pick 
> better compaction candidates. Instead of blindly merging sstables of a 
> similar size a la SizeTieredCompactionStrategy." The L0 STCS in LCS should 
> benefit as well.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to