On Mar 16, 2009, at 7:55 PM, Otis Gospodnetic wrote:
Eric,
There are a couple of ways you can back up a Lucene index built by
Solr:
1) have a look at the Solr replication scripts, specifically
snapshooter. This script creates a snapshot of an index. It's
typically triggered by Solr after its "commit" or "optimize" calls,
when the index is "stable" and not being modified. If you use
snapshooter to create index snapshots, you could simply grab a
snapshot and there is your backup.
2) have a look at Solr's new replication mechanism (info on the
Solr Wiki), which does something similar to the above, but without
relying on replication (shell) scripts. It does everything via HTTP.
In my 10 years of using Lucene and N years of using Solr and Nutch
I've never had index corruption. Nowadays Lucene even has
transactions, so it's much harder (theoretically impossible) to
corrupt the index.
Thank you for the information. I happened to read about snapshooter
about 10 minutes after I sent that message, but didn't know about
replication. It inspires confidence that you haven't experienced
index corruption in your years of using this technology.
Eric
--
Eric J. Christeson
<eric.christe...@ndsu.edu>
Enterprise Computing and Infrastructure (701) 231-8693 (Voice)
North Dakota State University