[ 
https://issues.apache.org/jira/browse/CASSANDRA-5722?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13706423#comment-13706423
 ] 

Tyler Hobbs commented on CASSANDRA-5722:
----------------------------------------

I wasn't aware of CASSANDRA-2524, so I'll look into that and see if there's any 
overlap.  Thinking about vnodes is actually why I suggested that technique 
instead of simply checking first/last keys for the sstable.

It was easy to put together a quick first version (not polished); you can view 
a diff here: 
https://github.com/thobbs/cassandra/compare/apache:trunk...CASSANDRA-5722.  
Even with vnodes, I think it's a win.
                
> Cleanup should skip sstables that don't contain data outside a nodes ranges
> ---------------------------------------------------------------------------
>
>                 Key: CASSANDRA-5722
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-5722
>             Project: Cassandra
>          Issue Type: Improvement
>            Reporter: Nick Bailey
>            Assignee: Tyler Hobbs
>             Fix For: 2.0.1
>
>
> Right now cleanup is optimized to simply delete sstables that *only* contain 
> data that doesn't belong on the node, for all other sstables though, it will 
> read them, check each row, and write out new sstables.
> Cleanup could be optimized to look at an sstable and determine that all data 
> within the sstable does belong on a node, and therefore skip re-writing that 
> sstable. This would make cleanup essentially a noop in the case where all 
> data on a node belongs on that node.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to