I wouldn't recommend doing this before regular backups for the simple reason that for large data sets it will take a looooong time to run, and will require that your node backup schedule be properly staggered (you should never be running repair on all nodes at the same time.) Backups should be treated as eventually consistent just like Cassandra itself.
That said, if you are doing a one-time backup of a node and for whatever reason you want it as up-to-date as possible without unnecessary data, you should also run nodetool compact. On Mon, Jun 2, 2014 at 2:18 PM, ng <[email protected]> wrote: > > I need to make sure that all the data in sstable before taking the > snapshot. > > I am thinking of > nodetool cleanup > nodetool repair > nodetool flush > nodetool snapshot > > Am I missing anything else? > > Thanks in advance for the responses/suggestions. > > ng >
