ndra.apache.org"
Subject: Re: sstableloader: How much does it actually need?
Message from External Sender
Yes you will have all the data in two nodes provided there is no mutation drop
at node level or data is repaired
For example if you data A,B,C and D. with RF=3 and 4 nodes (node1, nod
Yes you will have all the data in two nodes provided there is no mutation
drop at node level or data is repaired
For example if you data A,B,C and D. with RF=3 and 4 nodes (node1, node2,
node3 and node4)
Data A is in node1, node2 and node3
Data B is in node2, node3, and node4
Data C is in node3,
Been thinking about it, and I can't really see how with 4 nodes and RF=3,
any 2 nodes would *not* have all the data; but am more than willing to
learn.
On the other thing: that's an attractive option, but in our case, the
target cluster will likely come into use before the source-cluster data is
a
>
> Another option is the DSE-bulk loader but it will require to convert to
> csv/json (good option if you don't like to play with sstableloader and deal
> to get all the sstables from all the nodes)
> https://docs.datastax.com/en/dsbulk/doc/index.html
>
Thanks, Sergio. The DataStax Bulk Loader wa
Another option is to use the Spark migrator, it reads a source CQL cluster and
writes to another. It has a validation stage that compares a full scan
and reports the diff:
https://github.com/scylladb/scylla-migrator
There are many more ways to clone a cluster. My main recommendation is
to 'optimiz
Another option is the DSE-bulk loader but it will require to convert to
csv/json (good option if you don't like to play with sstableloader and deal
to get all the sstables from all the nodes)
https://docs.datastax.com/en/dsbulk/doc/index.html
Cheers
Sergio
Il giorno mer 5 feb 2020 alle ore 16:56
Unfortunately, there isn't a guarantee that 2 nodes alone will have the
full copy of data. I'd rather not say "it depends". 😁
TIP: If the nodes in the target cluster have identical tokens allocated,
you can just do a straight copy of the sstables node-for-node then do nodetool
refresh. If the targ
Scenario: Cassandra 3.11.x, 4 nodes, RF=3; moving to identically-sized
cluster via snapshots and sstableloader.
As far as I can tell, in the topology given above, any 2 nodes contain all
of the data. In terms of migrating this cluster, would there be any
downsides or risks with snapshotting and lo