Hi Michael,

Any update?

Regards,
Antony

On Sun, 1 May 2022 at 19:35, Antony Joseph <antony.dev.webm...@gmail.com>
wrote:

> Hi Michael,
>
> Thank you for your reply. Please find responses to your questions below.
>
> Regards,
> Antony
>
> On Sat, 30 Apr 2022 at 18:59, Michael McCandless <
> luc...@mikemccandless.com> wrote:
>
>> Hi Antony,
>>
>> Hmm it looks like the root cause is this:
>>
>>       Caused by: java.nio.file.NoSuchFileException: D:\i\202204\_14gb.si
>>
>> Can you list all the files in the index directory at the time this
>> exception happens, and reply here?  We need to figure out whether the file
>> is really missing or what.
>>
> Below the index directory file listing. Yes, file is missing (D:\i\202204\_
> 14gb.si)
>
>>
>> Do you run any virus scanner / disk file tree utilities / etc.?  In the
>> distant past sometimes such programs might cause strange transient errors
>> if they open a file for read exclusively or so, on windows.
>>
> There is no virus scanner running.
>
>>
>> What is the actual drive you are storing the index on (D:)?  Is it a
>> local disk or remote SMBFS mount?
>>
> It's a local disk (D:).
>
>>
>> Mike McCandless
>>
>> http://blog.mikemccandless.com
>>
>>
>> On Sat, Apr 30, 2022 at 8:39 AM Antony Joseph <
>> antony.dev.webm...@gmail.com> wrote:
>>
>>> Thank you for your reply.
>>>
>>> *The full stack trace is included:*
>>>
>>> <super: <class 'JavaError'>, <JavaError object>>
>>>     Java stacktrace:
>>> org.apache.lucene.index.CorruptIndexException: Unexpected file read error
>>> while
>>> reading index.
>>>
>>> (resource=BufferedChecksumIndexInput(MMapIndexInput(path="D:\i\202204\segments_10fj")))
>>>         at
>>> org.apache.lucene.index.SegmentInfos.readCommit(SegmentInfos.java:290)
>>>         at
>>>
>>> org.apache.lucene.index.IndexFileDeleter.<init>(IndexFileDeleter.java:165)
>>>         at
>>> org.apache.lucene.index.IndexWriter.<init>(IndexWriter.java:972)
>>> Caused by: java.nio.file.NoSuchFileException: D:\i\202204\_14gb.si
>>>         at sun.nio.fs.WindowsException.translateToIOException(Unknown
>>> Source)
>>>         at sun.nio.fs.WindowsException.rethrowAsIOException(Unknown
>>> Source)
>>>         at sun.nio.fs.WindowsException.rethrowAsIOException(Unknown
>>> Source)
>>>         at sun.nio.fs.WindowsFileSystemProvider.newFileChannel(Unknown
>>> Source)
>>>         at java.nio.channels.FileChannel.open(Unknown Source)
>>>         at java.nio.channels.FileChannel.open(Unknown Source)
>>>         at
>>> org.apache.lucene.store.MMapDirectory.openInput(MMapDirectory.java:238)
>>>         at
>>> org.apache.lucene.store.Directory.openChecksumInput(Directory.java:137)
>>>         at
>>>
>>> org.apache.lucene.codecs.lucene62.Lucene62SegmentInfoFormat.read(Lucene62SegmentInfoFormat.java:89)
>>>         at
>>> org.apache.lucene.index.SegmentInfos.readCommit(SegmentInfos.java:357)
>>>         at
>>> org.apache.lucene.index.SegmentInfos.readCommit(SegmentInfos.java:288)
>>>         ... 2 more
>>>
>>> Traceback (most recent call last):
>>>   File "index.py", line 112, in start
>>>     writer = IndexWriter(index_directory, iconfig)
>>> lucene.JavaError: <super: <class 'JavaError'>, <JavaError object>>
>>>     Java stacktrace:
>>> org.apache.lucene.index.CorruptIndexException: Unexpected file read error
>>> while
>>> reading index.
>>>
>>> (resource=BufferedChecksumIndexInput(MMapIndexInput(path="D:\i\202204\segments_10fj")))
>>>         at
>>> org.apache.lucene.index.SegmentInfos.readCommit(SegmentInfos.java:290)
>>>         at
>>>
>>> org.apache.lucene.index.IndexFileDeleter.<init>(IndexFileDeleter.java:165)
>>>         at
>>> org.apache.lucene.index.IndexWriter.<init>(IndexWriter.java:972)
>>> Caused by: java.nio.file.NoSuchFileException: D:\i\202204\_14gb.si
>>>         at sun.nio.fs.WindowsException.translateToIOException(Unknown
>>> Source)
>>>         at sun.nio.fs.WindowsException.rethrowAsIOException(Unknown
>>> Source)
>>>         at sun.nio.fs.WindowsException.rethrowAsIOException(Unknown
>>> Source)
>>>         at sun.nio.fs.WindowsFileSystemProvider.newFileChannel(Unknown
>>> Source)
>>>         at java.nio.channels.FileChannel.open(Unknown Source)
>>>         at java.nio.channels.FileChannel.open(Unknown Source)
>>>         at
>>> org.apache.lucene.store.MMapDirectory.openInput(MMapDirectory.java:238)
>>>         at
>>> org.apache.lucene.store.Directory.openChecksumInput(Directory.java:137)
>>>         at
>>>
>>> org.apache.lucene.codecs.lucene62.Lucene62SegmentInfoFormat.read(Lucene62SegmentInfoFormat.java:89)
>>>         at
>>> org.apache.lucene.index.SegmentInfos.readCommit(SegmentInfos.java:357)
>>>         at
>>> org.apache.lucene.index.SegmentInfos.readCommit(SegmentInfos.java:288)
>>>         ... 2 more
>>>
>>>
>>> Regards,
>>> Antony
>>>
>>> On Sat, 30 Apr 2022 at 10:59, Robert Muir <rcm...@gmail.com> wrote:
>>>
>>> > The most helpful thing would be the full stacktrace of the exception.
>>> > This exception should be chaining the original exception and call
>>> > site, and maybe tell us more about this error you hit.
>>> >
>>> > To me, it looks like a windows-specific issue where the filesystem is
>>> > returning an unexpected error. So it would be helpful to see exactly
>>> > which one that is, and the full trace of where it comes from, to chase
>>> > it further
>>> >
>>> > On Thu, Apr 28, 2022 at 12:10 PM Antony Joseph
>>> > <antony.dev.webm...@gmail.com> wrote:
>>> > >
>>> > > Thank you for your reply.
>>> > >
>>> > > This isn't happening in a single environment. Our application is
>>> being
>>> > used
>>> > > by various clients and this has been reported by multiple users -
>>> all of
>>> > > whom were running the earlier pylucene (v4.10) - without issues.
>>> > >
>>> > > One thing to mention is that our earlier version used Python 2.7.15
>>> (with
>>> > > pylucene 4.10) and now we are using Python 3.8.10 with Pylucene
>>> 6.5.0 -
>>> > the
>>> > > indexing logic is the same...
>>> > >
>>> > > One other thing to note is that the issue described has (so far!)
>>> only
>>> > > occurred on MS Windows - none of our Linux customers have complained
>>> > about
>>> > > this.
>>> > >
>>> > > Any ideas?
>>> > >
>>> > > Regards,
>>> > > Antony
>>> > >
>>> > > On Thu, 28 Apr 2022 at 17:00, Adrien Grand <jpou...@gmail.com>
>>> wrote:
>>> > >
>>> > > > Hi Anthony,
>>> > > >
>>> > > > This isn't something that you should try to fix programmatically,
>>> > > > corruptions indicate that something is wrong with the environment,
>>> > > > like a broken disk or corrupt RAM. I would suggest running a
>>> memtest
>>> > > > to check your RAM and looking at system logs in case they have
>>> > > > anything to tell about your disks.
>>> > > >
>>> > > > Can you also share the full stack trace of the exception?
>>> > > >
>>> > > > On Thu, Apr 28, 2022 at 10:26 AM Antony Joseph
>>> > > > <antony.dev.webm...@gmail.com> wrote:
>>> > > > >
>>> > > > > Hello,
>>> > > > >
>>> > > > > We are facing a strange situation in our application as described
>>> > below:
>>> > > > >
>>> > > > > *Using*:
>>> > > > >
>>> > > > >    - Python 3.8.10
>>> > > > >    - Pylucene 6.5.0
>>> > > > >    - Java 8 (1.8.0_181)
>>> > > > >    - Runs on Linux and Windows (error seen on Windows)
>>> > > > >
>>> > > > > We suddenly get the following *error*:
>>> > > > >
>>> > > > > 2022-02-10 09:58:09.253215: ERROR : writer | Failed to get index
>>> > > > > (D:\i\202202) writer, Exception:
>>> > > > > org.apache.lucene.index.CorruptIndexException: Unexpected file
>>> read
>>> > error
>>> > > > > while reading index.
>>> > > > >
>>> > > >
>>> >
>>> (resource=BufferedChecksumIndexInput(MMapIndexInput(path="D:\i\202202\segments_fo")))
>>> > > > >
>>> > > > >
>>> > > > > After this, no further indexing happens - trying to open the
>>> index
>>> > for
>>> > > > > writing throws the above error - and the index writer does not
>>> open.
>>> > > > >
>>> > > > > FYI, our code contains the following *settings*:
>>> > > > >
>>> > > > > index_path = "D:\i\202202"
>>> > > > > index_directory = FSDirectory.open(Paths.get(index_path))
>>> > > > > iconfig = IndexWriterConfig(wrapper_analyzer)
>>> > > > > iconfig.setOpenMode(IndexWriterConfig.OpenMode.CREATE_OR_APPEND)
>>> > > > > iconfig.setRAMBufferSizeMB(16.0)
>>> > > > > writer = IndexWriter(index_directory, iconfig)
>>> > > > >
>>> > > > >
>>> > > > > *Repairing*
>>> > > > > We tried 'repairing' the index with the following command / tool:
>>> > > > >
>>> > > > > java -cp lucene-core-6.5.0.jar:lucene-backward-codecs-6.5.0.jar
>>> > > > > org.apache.lucene.index.CheckIndex "D:\i\202202" -exorcise
>>> > > > >
>>> > > > > This however returns saying "No problems found with the index."
>>> > > > >
>>> > > > >
>>> > > > > *Work around*
>>> > > > > We have to manually delete the problematic segment file:
>>> > > > > D:\i\202202\segments_fo
>>> > > > > after which the application starts again... until the next
>>> > corruption. We
>>> > > > > can't spot a specific pattern.
>>> > > > >
>>> > > > >
>>> > > > > *Two questions:*
>>> > > > >
>>> > > > >    1. Can we handle this situation programmatically, so that no
>>> > manual
>>> > > > >    intervention is needed?
>>> > > > >    2. Any reason why we are facing the corruption issue in the
>>> first
>>> > > > place?
>>> > > > >
>>> > > > >
>>> > > > > Before this we were using Pylucene 4.10 and we didn't face this
>>> > problem -
>>> > > > > the application logic is the same.
>>> > > > >
>>> > > > > Also, while the application runs on both Linux and Windows, so
>>> far we
>>> > > > have
>>> > > > > observed this situation only on various Windows platforms.
>>> > > > >
>>> > > > > Would really appreciate some assistance. Thanks in advance.
>>> > > > >
>>> > > > > Regards,
>>> > > > > Antony
>>> > > >
>>> > > >
>>> > > >
>>> > > > --
>>> > > > Adrien
>>> > > >
>>> > > >
>>> ---------------------------------------------------------------------
>>> > > > To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
>>> > > > For additional commands, e-mail: java-user-h...@lucene.apache.org
>>> > > >
>>> > > >
>>> >
>>> > ---------------------------------------------------------------------
>>> > To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
>>> > For additional commands, e-mail: java-user-h...@lucene.apache.org
>>> >
>>> >
>>>
>>

Reply via email to