Re: [jira] Created: (LUCENE-1328) FileNotFoundException in

Yajun Mon, 07 Jul 2008 11:13:01 -0700

My answer is inline.



Michael McCandless-2 wrote:
> 
> Yajun Liu (JIRA) <[EMAIL PROTECTED]> wrote:
> 
> Can you double check which underlying version of Lucene you are
> using?  Those source file/line numbers don't line up to a stock 2.1
> release as far as I can tell.
> 
> YL>> The lucene library comes with Solr. The jar file is
> lucene-core-2007-05-20_00-04-53.jar. I compared the source code of
> IndexReader which is close to Lucene 2.2, not 2.1
> 
> This part is odd:
> 
>> When this happen, the program automatically tried to reopen the most
>> recent THREE snapshots and we got the same exception for each
>> snapshot. Remember, each of the snapshot was validated before it was
>> copied.
> 
> If I understand the steps correctly, once and index is created,
> verified (by opening an IndexReader) and copied up to a snapshot, that
> snapshot is never changed by Lucene (opened by an IndexWriter, or
> IndexReader that does delete/setNorm)?  Yet somehow all 3 become
> corrupt at the same time (that exception is hit when opening an
> IndexReader)?
> 
> YL>> That's correct. Here is the sudo code of taking a snapshot:
>     create a tmp directory /var/tmp/snapshot_timestamp
>     int rety = 0;
>     while (retry++ < 10) {
>          copy index to /var/tmp/snapshot_timestamp
>          try {
>              IndexReader reader =
> IndexReader.open(/var/tmp/snapshot_timestamp/index);
>              reader.close();
>          } catch (IOException e) {
>              delete /var/tmp/snapshot_timestamp/index
>              continue;
>          }
>      }
> 
> Did you say IndexReader also does delete? If that is the case, then the
> snapshot that is "validated" by IndexReader could be "invalidated" by this
> deletion. That could explain all the snapshots become corrupted.     
>          
>          
> Also:
> 
>> 2) The similar things happened on the search node: the same index
>> which was opened OK during last night nodes recycle could not be
>> opened due to the same exception. The search node does not update
>> index.
> 
> Again nothing makes changes to these copies either, yet they suddenly
> starting hitting that same exception?
> 
> YL>> correct. Nothing change on the search node.
> 
> Are you really sure, when you go to open the snapshots, that you're
> actually specifying the right directory each time?  It seems
> particularly odd for all 3 snapshots to hit the same exception at the
> same time.
> 
> YL>> very sure. 
> 
> Another question: is it possible you are reopening your readers on an
> index dir while rsync is still copying to that index dir?  That could
> easily
> lead to exceptions like this....
> 
> YL>> index was rsync to a /var/tmp/rsync_tmp directory. When rsync is
> done, it is moved to a directory and then search node open indexreader on
> it.
> 
> Mike
> 
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: [EMAIL PROTECTED]
> For additional commands, e-mail: [EMAIL PROTECTED]
> 
> 
> 

-- 
View this message in context: 
http://www.nabble.com/-jira--Created%3A-%28LUCENE-1328%29-FileNotFoundException-in-tp18311277p18323050.html
Sent from the Lucene - Java Developer mailing list archive at Nabble.com.


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Re: [jira] Created: (LUCENE-1328) FileNotFoundException in

Reply via email to