Re: Pause and resume indexing on SolR 4 for backups

Lance Norskog Thu, 20 Dec 2012 10:45:57 -0800

To be clear: 1) is fine. Lucene index updates are carefully sequenced sothat the index is never in a bogus state. All data files are written andflushed to disk, then the segments.* files are written that match thedata files. You can capture the files with a set of hard links to createa backup.


The CheckIndex program will verify the index backup.

java -cp yourcopy/lucene-core-SOMETHING.jarorg.apache.lucene.index.CheckIndex collection/data/index

lucene-core-SOMETHING.jar is usually in the solr-webapp directory whereSolr is unpacked.


On 12/20/2012 02:16 AM, Andy D'Arcy Jewell wrote:

Hi all.
Can anyone advise me of a way to pause and resume SolR 4 so I canperform a backup? I need to be able to revert to a usable (though notnecessarily complete) index after a crash or other "disaster" morequickly than a re-index operation would yield.
I can't yet afford the "extravagance" of a separate SolR replica justfor backups, and I'm not sure if I'll ever have the luxury. I'mcurrently running with just one node, be we are not yet live.
I can think of the following ways to do this, each with variousdownsides:
1) Just backup the existing index files whilst indexing continues
    + Easy
    + Fast
    - Incomplete
    - Potential for corruption? (e.g. partial files)

2) Stop/Start Tomcat
    + Easy
    - Very slow and I/O, CPU intensive
    - Client gets errors when trying to connect

3) Block/unblock SolR port with IpTables
    + Fast
    - Client gets errors when trying to connect
- Have to wait for existing transactions to complete (not surehow, maybe watch socket FD's in /proc)
4) Pause/Restart SolR service
    + Fast ? (hopefully)
    - Client gets errors when trying to connect
In any event, the web app will have to gracefully handleunavailability of SolR, probably by displaying a "down formaintenance" message, but this should preferably be only a very shortamount of time.
Can anyone comment on my proposed solutions above, or provide anyadditional ones?
Thanks for any input you can provide!

-Andy

Re: Pause and resume indexing on SolR 4 for backups

Reply via email to