Thank you for your reply.

This isn't happening in a single environment. Our application is being used
by various clients and this has been reported by multiple users - all of
whom were running the earlier pylucene (v4.10) - without issues.

One thing to mention is that our earlier version used Python 2.7.15 (with
pylucene 4.10) and now we are using Python 3.8.10 with Pylucene 6.5.0 - the
indexing logic is the same...

One other thing to note is that the issue described has (so far!) only
occurred on MS Windows - none of our Linux customers have complained about
this.

Any ideas?

Regards,
Antony

On Thu, 28 Apr 2022 at 17:00, Adrien Grand <jpou...@gmail.com> wrote:

> Hi Anthony,
>
> This isn't something that you should try to fix programmatically,
> corruptions indicate that something is wrong with the environment,
> like a broken disk or corrupt RAM. I would suggest running a memtest
> to check your RAM and looking at system logs in case they have
> anything to tell about your disks.
>
> Can you also share the full stack trace of the exception?
>
> On Thu, Apr 28, 2022 at 10:26 AM Antony Joseph
> <antony.dev.webm...@gmail.com> wrote:
> >
> > Hello,
> >
> > We are facing a strange situation in our application as described below:
> >
> > *Using*:
> >
> >    - Python 3.8.10
> >    - Pylucene 6.5.0
> >    - Java 8 (1.8.0_181)
> >    - Runs on Linux and Windows (error seen on Windows)
> >
> > We suddenly get the following *error*:
> >
> > 2022-02-10 09:58:09.253215: ERROR : writer | Failed to get index
> > (D:\i\202202) writer, Exception:
> > org.apache.lucene.index.CorruptIndexException: Unexpected file read error
> > while reading index.
> >
> (resource=BufferedChecksumIndexInput(MMapIndexInput(path="D:\i\202202\segments_fo")))
> >
> >
> > After this, no further indexing happens - trying to open the index for
> > writing throws the above error - and the index writer does not open.
> >
> > FYI, our code contains the following *settings*:
> >
> > index_path = "D:\i\202202"
> > index_directory = FSDirectory.open(Paths.get(index_path))
> > iconfig = IndexWriterConfig(wrapper_analyzer)
> > iconfig.setOpenMode(IndexWriterConfig.OpenMode.CREATE_OR_APPEND)
> > iconfig.setRAMBufferSizeMB(16.0)
> > writer = IndexWriter(index_directory, iconfig)
> >
> >
> > *Repairing*
> > We tried 'repairing' the index with the following command / tool:
> >
> > java -cp lucene-core-6.5.0.jar:lucene-backward-codecs-6.5.0.jar
> > org.apache.lucene.index.CheckIndex "D:\i\202202" -exorcise
> >
> > This however returns saying "No problems found with the index."
> >
> >
> > *Work around*
> > We have to manually delete the problematic segment file:
> > D:\i\202202\segments_fo
> > after which the application starts again... until the next corruption. We
> > can't spot a specific pattern.
> >
> >
> > *Two questions:*
> >
> >    1. Can we handle this situation programmatically, so that no manual
> >    intervention is needed?
> >    2. Any reason why we are facing the corruption issue in the first
> place?
> >
> >
> > Before this we were using Pylucene 4.10 and we didn't face this problem -
> > the application logic is the same.
> >
> > Also, while the application runs on both Linux and Windows, so far we
> have
> > observed this situation only on various Windows platforms.
> >
> > Would really appreciate some assistance. Thanks in advance.
> >
> > Regards,
> > Antony
>
>
>
> --
> Adrien
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
> For additional commands, e-mail: java-user-h...@lucene.apache.org
>
>

Reply via email to