Re: [jira] Commented: (LUCENE-743) IndexReader.reopen()

robert engels Mon, 12 Nov 2007 14:08:48 -0800

Then how can the commit during reopen be an issue?

I am not very family with this new code, but it seems that you needto write segments.XXX.new and then rename to segments.XXX.

As long as the files are sync'd, even on nfs the reopen should notsee segments.XXX until is is ready.

Although lockless commits are beneficial in their own rite, I stillthink that people's understanding of NFS limitations are flawed. Readthe section below on "close to open" consistency. There should be noproblem using Lucene across NFS - even the old version.

The write-once nature of Lucene makes this trivial. The only problemwas the segments file, which is lucene used the read/write lock andclose(0 correctly never would have been a problem.


According to the NFS docs:

NFS Version 2 requires that a server must save all the data in awrite operation to disk before it replies to a client that the writeoperation has completed. This can be expensive because it breakswrite requests into small chunks (8KB or less) that must each bewritten to disk before the next chunk can be written. Disks work bestwhen they can write large amounts of data all at once.

NFS Version 3 introduces the concept of "safe asynchronous writes." AVersion 3 client can specify that the server is allowed to replybefore it has saved the requested data to disk, permitting the serverto gather small NFS write operations into a single efficient diskwrite operation. A Version 3 client can also specify that the datamust be written to disk before the server replies, just like aVersion 2 write. The client specifies the type of write by settingthe stable_how field in the arguments of each write operation toUNSTABLE to request a safe asynchronous write, and FILE_SYNC for anNFS Version 2 style write.

Servers indicate whether the requested data is permanently stored bysetting a corresponding field in the response to each NFS writeoperation. A server can respond to an UNSTABLE write request with anUNSTABLE reply or a FILE_SYNC reply, depending on whether or not therequested data resides on permanent storage yet. An NFS protocol-compliant server must respond to a FILE_SYNC request only with aFILE_SYNC reply.

Clients ensure that data that was written using a safe asynchronouswrite has been written onto permanent storage using a new operationavailable in Version 3 called a COMMIT. Servers do not send aresponse to a COMMIT operation until all data specified in therequest has been written to permanent storage. NFS Version 3 clientsmust protect buffered data that has been written using a safeasynchronous write but not yet committed. If a server reboots beforea client has sent an appropriate COMMIT, the server can reply to theeventual COMMIT request in a way that forces the client to resend theoriginal write operation. Version 3 clients use COMMIT operationswhen flushing safe asynchronous writes to the server during a close(2) or fsync(2) system call, or when encountering memory pressure.




A8. What is close-to-open cache consistency?

A. Perfect cache coherency among disparate NFS clients is veryexpensive to achieve, so NFS settles for something weaker thatsatisfies the requirements of most everyday types of file sharing.Everyday file sharing is most often completely sequential: firstclient A opens a file, writes something to it, then closes it; thenclient B opens the same file, and reads the changes.

So, when an application opens a file stored in NFS, the NFS clientchecks that it still exists on the server, and is permitted to theopener, by sending a GETATTR or ACCESS operation. When theapplication closes the file, the NFS client writes back any pendingchanges to the file so that the next opener can view the changes.This also gives the NFS client an opportunity to report any serverwrite errors to the application via the return code from close().This behavior is referred to as close-to-open cache consistency.

Linux implements close-to-open cache consistency by comparing theresults of a GETATTR operation done just after the file is closed tothe results of a GETATTR operation done when the file is next opened.If the results are the same, the client will assume its data cache isstill valid; otherwise, the cache is purged.

Close-to-open cache consistency was introduced to the Linux NFSclient in 2.4.20. If for some reason you have applications thatdepend on the old behavior, you can disable close-to-open support byusing the "nocto" mount option.

There are still opportunities for a client's data cache to containstale data. The NFS version 3 protocol introduced "weak cacheconsistency" (also known as WCC) which provides a way of checking afile's attributes before and after an operation to allow a client toidentify changes that could have been made by other clients.Unfortunately when a client is using many concurrent operations thatupdate the same file at the same time, it is impossible to tellwhether it was that client's updates or some other client's updatesthat changed the file.

For this reason, some versions of the Linux 2.6 NFS client abandonWCC checking entirely, and simply trust their own data cache. Onthese versions, the client can maintain a cache full of stale filedata if a file is opened for write. In this case, using file lockingis the best way to ensure that all clients see the latest version ofa file's data.

A system administrator can try using the "noac" mount option toachieve attribute cache coherency among multiple clients. Almostevery client operation checks file attribute information. Usually theclient keeps this information cached for a period of time to reducenetwork and server load. When "noac" is in effect, a client's fileattribute cache is disabled, so each operation that needs to check afile's attributes is forced to go back to the server. This permits aclient to see changes to a file very quickly, at the cost of manyextra network operations.

Be careful not to confuse "noac" with "no data caching." The "noac"mount option will keep file attributes up-to-date with the server,but there are still races that may result in data incoherency betweenclient and server. If you need absolute cache coherency amongclients, applications can use file locking, where a client purgesfile data when a file is locked, and flushes changes back to theserver before unlocking a file; or applications can open their fileswith the O_DIRECT flag to disable data caching entirely.

For a better understanding of the compromises faced in the design ofNFS caching, see Callaghan's "NFS Illustrated."




On Nov 12, 2007, at 3:47 PM, Yonik Seeley wrote:

On Nov 12, 2007 4:43 PM, robert engels <[EMAIL PROTECTED]> wrote:

Why doesn't reopen get the 'read' lock, since commit has the write
lock, it should wait...


After lockless commits, there is no read lock!

-Yonik

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Re: [jira] Commented: (LUCENE-743) IndexReader.reopen()

Reply via email to