[ 
https://issues.apache.org/jira/browse/CASSANDRA-4756?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15033128#comment-15033128
 ] 

Juho Mäkinen commented on CASSANDRA-4756:
-----------------------------------------

This seems to be related to my issue CASSANDRA-10757 where cluster compaction 
requires significant time after a bit sstableloader migration. Is there 
anything that can be done to improve this?

> Bulk loading snapshots creates RF^2 copies of the data
> ------------------------------------------------------
>
>                 Key: CASSANDRA-4756
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-4756
>             Project: Cassandra
>          Issue Type: Improvement
>    Affects Versions: 1.2.0 beta 1
>            Reporter: Nick Bailey
>
> Since a cluster snapshot will contain rf copies of each piece of data, 
> bulkloading all of those snapshots will create rf^2 copies of each piece of 
> data.
> Not sure what the solution here is. Ideally we would merge the RF copies of 
> the data before sending to the cluster. This would solve any inconsistencies 
> that existed when the snapshot was taken.
> A more naive approach of only loading one of the RF copies and assuming there 
> are no inconsistencies might be an easier goal for the near term though.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to