[ 
https://issues.apache.org/jira/browse/CASSANDRA-5366?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13607797#comment-13607797
 ] 

Brooke Bryan commented on CASSANDRA-5366:
-----------------------------------------

The problem we have, is the amount of data within the node, we could have a few 
large compactions run, before the upgrade process gets around to that specific 
CF or SSTable, meaning we are re-processing some pretty large files (few 
hundred GBs).  An upgradesstables process is taking around 3 or 4 days to 
complete at the moment, with the main problem being re-processing these large 
sstables.
                
> UpgradeSSTables Optimisation
> ----------------------------
>
>                 Key: CASSANDRA-5366
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-5366
>             Project: Cassandra
>          Issue Type: Improvement
>            Reporter: Brooke Bryan
>
> Currently, if you run upgradesstables, cassandra will run through every 
> single SSTable within the scope of the request.  Where we have some large 
> tables, an upgrade on a single sstable can take hours, even if its already 
> sat on the same version.
> After upgrading to a new cassandra version, it would be ideal to be able to 
> upgrade only sstables not sat in the latest version, as it seems like it just 
> needs to do a massive amount of disk IO, with nothing being achieved at the 
> end of it.
> Maybe its worth putting an option onto the nodetool command, or creating a 
> new command for this type of upgrade

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to