Nick Bailey created CASSANDRA-7779:
--------------------------------------

             Summary: Add option to sstableloader to only stream to the local dc
                 Key: CASSANDRA-7779
                 URL: https://issues.apache.org/jira/browse/CASSANDRA-7779
             Project: Cassandra
          Issue Type: Improvement
          Components: Tools
            Reporter: Nick Bailey
             Fix For: 1.2.19, 2.0.10, 2.1.1


This is meant to be a potential workaround for CASSANDRA-4756. Due to that 
ticket, trying to load a cluster wide snapshot via sstableloader will 
potentially stream an enormous amount of data. In a 3 datacenter cluster with 
rf=3 in each datacenter, 81 copies of the data would be streamed. Once we have 
per range sstables we can optimize sstableloader to merge data and only stream 
one copy, but until then we need a workaround. By only streaming to the local 
datacenter we can load the data locally in each datacenter and only have 9 
copies of the data rather than 81.

This could potentially be achieved by the option to ignore certain nodes that 
already exists in sstableloader, but in the case of vnodes and topology changes 
in the cluster, this could require specifying every node in the cluster as 
'ignored' on the command line which could be problematic. This is just a 
shortcut to avoid that.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to