He explained why in the message. Because it is faster to bring up a new host from a snapshot.
I presume that he doesn't need the full cluster running all the time. wunder On Sep 20, 2012, at 2:19 PM, Markus Jelsma wrote: > Hi, > > Why do you want to back up? With enough machines and a decent replication > factor (3 or higher) there is usually little need to back it up. If you have > the space it's better to launch a second cluster in another DC. > > You can also choose to increase the number of maxCommitsToKeep but it'll take > a lot of space and I/O if you have a frequent auto-commit enabled. Another > options is to keep generating raw Solr XML files, you can easily load > millions of documents in 10-15 minutes. > > Cheers, > > -----Original message----- >> From:jimtronic <jimtro...@gmail.com> >> Sent: Thu 20-Sep-2012 23:04 >> To: solr-user@lucene.apache.org >> Subject: Backup strategy for SolrCloud >> >> I'm trying to determine my options for backing up data from a SolrCloud >> cluster. >> >> For me, bringing up my cluster from scratch can take several hours. It's way >> faster to take snapshots of the index periodically and then use one of these >> when booting a new instance. Since I use static xml files and delta-imports, >> everything catches up on quickly. >> >> Sorry if this is a dumb question, but where do I pull the snapshots from? >> Zookeeper? Any box in the cluster? The leader? >> >> Thanks! >> Jim >>