One typical reason is to protect against human error. 

> On 7.12.2013, at 11.09, Jason Wee <peich...@gmail.com> wrote:
> 
> Hmm... cassandra fundamental key features like fault tolerant, durable and 
> replication. Just out of curiousity, why would you want to do backup?
> 
> /Jason
> 
> 
>> On Sat, Dec 7, 2013 at 3:31 AM, Robert Coli <rc...@eventbrite.com> wrote:
>>> On Fri, Dec 6, 2013 at 6:41 AM, Amalrik Maia <amal...@s1mbi0se.com.br> 
>>> wrote:
>>> hey guys, I'm trying to take backups of a multi-node cassandra and save 
>>> them on S3. 
>>> My idea is simply doing ssh to each server and use nodetool to create the 
>>> snapshots then push then to S3. 
>> 
>> https://github.com/synack/tablesnap
>> 
>>> So is this approach recommended? my concerns are about inconsistencies that 
>>> this approach can lead, since the snapshots are taken one by one and not in 
>>> parallel.  
>>> Should i worry about it or cassandra finds a way to deal with 
>>> inconsistencies when doing a restore?
>> 
>> The backup is as consistent as your cluster is at any given moment, which is 
>> "not necessarily". Manual repair brings you closer to consistency, but only 
>> on data present when the repair started.
>> 
>> =Rob 
> 

Reply via email to