Nick Bailey created CASSANDRA-4756:
--------------------------------------
Summary: Bulk loading snapshots creates RF^2 copies of the data
Key: CASSANDRA-4756
URL: https://issues.apache.org/jira/browse/CASSANDRA-4756
Project: Cassandra
Issue Type: Improvement
Affects Versions: 1.2.0 beta 1
Reporter: Nick Bailey
Since a cluster snapshot will contain rf copies of each piece of data,
bulkloading all of those snapshots will create rf^2 copies of each piece of
data.
Not sure what the solution here is. Ideally we would merge the RF copies of the
data before sending to the cluster. This would solve any inconsistencies that
existed when the snapshot was taken.
A more naive approach of only loading one of the RF copies and assuming there
are no inconsistencies might be an easier goal for the near term though.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira