Yes, I never thought of that. Thanks
Jabbar Azam On 12 June 2014 19:45, Jeremy Jongsma <jer...@barchart.com> wrote: > That will not necessarily scale, and I wouldn't recommend it - your > "backup node" will need as much disk space as an entire replica of the > cluster data. For a cluster with a couple of nodes that may be OK, for > dozens of nodes, probably not. You also lose the ability to restore > individual nodes - the only way to replace a dead node is with a full > repair. > > > On Thu, Jun 12, 2014 at 1:38 PM, Jabbar Azam <aja...@gmail.com> wrote: > >> There is another way. You create a cassandra node in it's own datacentre, >> then any changes going to the main cluster will be replicated to this node. >> You can backup from this node. In the event of a disaster the data from >> both clusters and wiped and then replayed to the individual node. The data >> will then be replicated to the main cluster. >> >> This will also work for the case when the main cluster increases or >> decreases in size. >> >> Thanks >> >> Jabbar Azam >> >> >> On 12 June 2014 18:27, Andrew <redmu...@gmail.com> wrote: >> >>> There isn’t a lot of “actual documentation” on the act of backing up, >>> but I did research for my own company into the act of backing up and >>> unfortunately, you’re not going to have a similar setup as Oracle. There >>> are reasons for this, however. >>> >>> If you have more than one replica of the data, that means each node in >>> the cluster will likely be holding it’s own unique set of data. So you >>> would need to back up the ENTIRE set of nodes in order to get an accurate >>> snapshot. Likewise, you would need to restore it to the cluster of the >>> same size in order to restore it (and then run refresh to tell Cassandra to >>> reload the tables from disk). >>> >>> Copying the snapshots is easy—it’s just a bunch of files in your data >>> directory. It’s even smaller if you use incremental snapshots. I’ll >>> admit, I’m no expert on tape drives, but I’d imagine it’s as easy as >>> copy/pasting the snapshots to the drive (or whatever the equivalent tape >>> drive operation is). >>> >>> What you (and I, admittedly) would really like to see is a way to back >>> up all the logical *data*, and then simply replay it. This is possible on >>> Oracle because it’s typically restricted to either one (plus maybe one or >>> two standbys) that don’t “share” any data. What you could do, in theory, >>> is literally select all the data in the entire cluster and simply dump it >>> to a file—but this could take hours, days, or even weeks to complete, >>> depending on the size of your data, and then simply re-load it. This is >>> probably not a great solution, but hey—maybe it will work for you. >>> >>> Netflix (thankfully) has posted a lot of their operational observations >>> and what not, including their utility Priam. In their documentation, they >>> include some overviews of what they use: >>> https://github.com/Netflix/Priam/wiki/Backups >>> >>> Hope this helps! >>> >>> Andrew >>> >>> On June 12, 2014 at 6:18:57 AM, Jack Krupansky (j...@basetechnology.com) >>> wrote: >>> >>> The doc for backing up – and restoring – Cassandra is here: >>> >>> http://www.datastax.com/documentation/cassandra/2.0/cassandra/operations/ops_backup_restore_c.html >>> >>> That doesn’t tell you how to move the “snapshot” to or from tape, but a >>> snapshot is the starting point for backing up Cassandra. >>> >>> -- Jack Krupansky >>> >>> *From:* Camacho, Maria (NSN - FI/Espoo) <maria.cama...@nsn.com> >>> *Sent:* Thursday, June 12, 2014 4:57 AM >>> *To:* user@cassandra.apache.org >>> *Subject:* Backup Cassandra to >>> >>> >>> Hi there, >>> >>> >>> >>> I'm trying to find information/instructions about backing up and >>> restoring a Cassandra DB to and from a tape unit. >>> >>> >>> >>> I was hopping someone in this forum could help me with this since I >>> could not find anything useful in Google :( >>> >>> >>> >>> Thanks in advance, >>> >>> Maria >>> >>> >>> >>> >> >