On 2017-07-11 20:09 (-0700), "Kevin O'Connor" <ke...@reddit.com.INVALID> wrote: 
> This might be an interesting question - but is there a way to truncate data
> from just a single node or two as a test instead of truncating from the
> entire cluster? We have time series data we don't really care if we're
> missing gaps in, but it's taking up a huge amount of space and we're
> looking to clear some. I'm worried if we run a truncate on this huge CF
> it'll end up locking up the cluster, but I don't care so much if it just
> kills a single node.
> 

IF YOU CAN TOLERATE DATA INCONSISTENCIES, You can stop a node, delete some 
sstables, and start it again. The risk in deleting arbitrary sstables is that 
you may remove a tombstone and bring data back to life, or remove the only 
replica with a write if you write at CL:ONE, but if you're OK with data going 
missing, you won't hurt much as long as you stop cassandra before you go 
killing sstables.

TWCS does make this easier, because you can use sstablemetadata to identify 
timestamps/tombstone %s, and then nuke sstables that are old/mostly-expired 
first.


> Is doing something like deleting SSTables from disk possible? If I alter
> this keyspace from an RF of 2 down to 1 and then delete them, they won't be
> able to be repaired if I'm thinking this through right.
> 

If you drop RF from 2 to 1, you can just run cleanup and delete half the data 
(though it'll rewrite sstables to do it, which will be a short term increase).


---------------------------------------------------------------------
To unsubscribe, e-mail: user-unsubscr...@cassandra.apache.org
For additional commands, e-mail: user-h...@cassandra.apache.org

Reply via email to