Re: [jira] Resolved: (LUCENE-1044) Behavior on hard power shutdown

Mark Miller Sun, 04 Nov 2007 07:52:08 -0800

Even if we cannot guarantee durability, it would be nice if we couldguarantee a consistent index. It sounds like the only problem in amachine with a lying drive is that you could lose a number of committedtransactions. I would much prefer that to a corrupted index. I canalways re-add what was lost much quicker than rebuilding a 5 million docarchive. In either case, I have my choice between the two as long as theindex is guaranteed to be corruption free.


robert engels wrote:

Usually you can configure the drives so that sync() ALWAYS syncs -drive jumpers, driver setup, or other methods. Some drives that arebattery backed and such do not need it.
Without sync() truly being a sync you could never write a databasethat was resilient.
It will exact a heavier toll on performance that you might think. Inorder to do it properly, all filesystem metadata must be sync;d aswell. The biggest difference is that you lose the degree ofmulti-processing that is inherent when sync'ing is disabled - as thedrive (or OS) does the physical write asynchronously while the systemdoes other work - with sync() this is lost.
This is why in a db system, the only file that is sync'd is the logfile - all other files can be made "in sync" from the log file - andthis file is normally striped for optimum write performance. Somesystems have special "log file drives" (some even solid state, orbattery backed ram) to aid the performance.
On Nov 4, 2007, at 8:30 AM, Yonik Seeley wrote:
On 11/4/07, Michael McCandless <[EMAIL PROTECTED]> wrote:
The problem is, on a hard shutdown (kill -9 or JVM/machine crashes),
apparently future operations may have completed while some past
operations have not.  For example, the new segments_N file was
successfully written while say the _X.fdx file of the just-flushed
segment was not successfully written, even though Lucene had written &
closed _X.fdx before segments_N.
That should be impossible except for a machine crash.  Kill -9 or a
JVM crash should have no effect on data already written.

But a sync option would be both simple and useful for people trying to
take live snapshots of an index, or to protect against machine
crashes.  This isn't an absolute 100% guarantee either (so don't test
for it) - the drives often lie to the OS about data being flushed.
It's the best we can do at our level though.
http://www.google.com/search?q=fsync+drive+lies

-Yonik

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Re: [jira] Resolved: (LUCENE-1044) Behavior on hard power shutdown

Reply via email to