On Mar 16, 2009, at 7:55 PM, Otis Gospodnetic wrote:


Eric,

There are a couple of ways you can back up a Lucene index built by Solr:

1) have a look at the Solr replication scripts, specifically snapshooter. This script creates a snapshot of an index. It's typically triggered by Solr after its "commit" or "optimize" calls, when the index is "stable" and not being modified. If you use snapshooter to create index snapshots, you could simply grab a snapshot and there is your backup.

2) have a look at Solr's new replication mechanism (info on the Solr Wiki), which does something similar to the above, but without relying on replication (shell) scripts. It does everything via HTTP.

In my 10 years of using Lucene and N years of using Solr and Nutch I've never had index corruption. Nowadays Lucene even has transactions, so it's much harder (theoretically impossible) to corrupt the index.

Thank you for the information. I happened to read about snapshooter about 10 minutes after I sent that message, but didn't know about replication. It inspires confidence that you haven't experienced index corruption in your years of using this technology.

Eric

--
Eric J. Christeson <eric.christe...@ndsu.edu>
Enterprise Computing and Infrastructure    (701) 231-8693 (Voice)
North Dakota State University

Reply via email to