[ 
https://issues.apache.org/jira/browse/CASSANDRA-4234?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Yuki Morishita updated CASSANDRA-4234:
--------------------------------------

    Attachment: 4234.txt

(Attaching modified and rebased patch)

So, after running some benchmark, I concluded that getOverlappingSSTables is 
not going to be a bottleneck. After all it's O(log n) operation.

But the problem rises if you iterate on 50K (or large number of) sstables. So I 
modified the patch to first sort sstables by droppable ratio in descent order, 
and skip iteration if it finds the ratio below threshold. I think this feature 
combined with the nature of LCS (fewer overlap among sstables) prevent a lot of 
calculation in findDroppableSSTable.
                
> Add tombstone-removal compaction to LCS
> ---------------------------------------
>
>                 Key: CASSANDRA-4234
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-4234
>             Project: Cassandra
>          Issue Type: Improvement
>    Affects Versions: 1.2
>            Reporter: Jonathan Ellis
>            Assignee: Yuki Morishita
>            Priority: Minor
>              Labels: compaction
>             Fix For: 1.2
>
>         Attachments: 4234.txt
>
>
> CASSANDRA-3442 will recompact sstables with high levels of expired 
> tombstones, but only under SCS.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to