Re: real time updates

Michael McCandless Sun, 15 Mar 2009 16:21:49 -0700


Marvin Humphrey wrote:

Lucene also has a blip, but it's different because Lucene willstill acceptadded/deleted documents; but, one cannot reopen a new realtime(LUCENE-1516)
reader during the blip.
The consolidator process has to block while carrying forwarddeletes, because
otherwise new deletions may get dropped.
If seg_2 is getting merged away and a new writer adds deletionsagainst seg_2that the consolidator never sees, then once the consolidatorfinishes, thosedeletes will vanish without a trace and the "deleted" document willsuddenly
reappear in the newly consolidated segment.

Right. I guess it's because Lucene buffers up deletes that it cancontinueto accept adds & deletes even during the blip. But it cannot write anew

segment (materialize the adds & deletes) during the blip.

Hang on: does your writer process hold onto the write lock the whole
time it's open? Or it only grabs it when it needs to commit achange?
The consolidator grabs consolidate.lock as soon as it launches.While it'sworking in the background (so to speak), write process continuallygraband release write.lock. At the very end of the consolidationprocess, theconsolidator grabs write.lock so that it can carry forward recentdeletions --
but hopefully that doesn't take very long.


OK.  Does this mean you can run multiple writers against the same index,

to gain concurrency? (Though... that's tricky, with deletes; oh maybebecauseyou store new deletes for an old segment along with the new segmentthat's

OK?  Hmm, it still seems like you'd have a staleness problem).

Unfortuntely, we have an annoying IPC issue to deal with. (Lucenewouldn'thave this problem.) When it's time for the consolidator to grabwrite.lock,it will try to obtain it once per second for X seconds, sleeping inbetween.But if index mods are flying fast and furious, write processes maycontinually
cut in front and the consolidator may have difficulty obtaining the
write.lock.
We'd like to be able to signal the waiting consolidator process whena writeprocess finishes up so that it can try for write.lock right away,but AFAIKthere's no portable way to communicate that from one process toanother.
Probably the only workaround is to add yet another lock file, e.g.
consolidator_is_waiting.lock, that blocks further write processes.Yuck. Wemay also want to have the consolidator try more often than once persecond.



Ugh, lock starvation.  Really the OS should provide a FIFO lock queue of
some sort.

Mike

Re: real time updates

Reply via email to