Hi Michael,

Thank you for your reply. Please find responses to your questions below.

Regards,
Antony

On Sat, 30 Apr 2022 at 18:59, Michael McCandless <luc...@mikemccandless.com>
wrote:

> Hi Antony,
>
> Hmm it looks like the root cause is this:
>
>       Caused by: java.nio.file.NoSuchFileException: D:\i\202204\_14gb.si
>
> Can you list all the files in the index directory at the time this
> exception happens, and reply here?  We need to figure out whether the file
> is really missing or what.
>
Below the index directory file listing. Yes, file is missing (D:\i\202204\_
14gb.si)

>
> Do you run any virus scanner / disk file tree utilities / etc.?  In the
> distant past sometimes such programs might cause strange transient errors
> if they open a file for read exclusively or so, on windows.
>
There is no virus scanner running.

>
> What is the actual drive you are storing the index on (D:)?  Is it a local
> disk or remote SMBFS mount?
>
It's a local disk (D:).

>
> Mike McCandless
>
> http://blog.mikemccandless.com
>
>
> On Sat, Apr 30, 2022 at 8:39 AM Antony Joseph <
> antony.dev.webm...@gmail.com> wrote:
>
>> Thank you for your reply.
>>
>> *The full stack trace is included:*
>>
>> <super: <class 'JavaError'>, <JavaError object>>
>>     Java stacktrace:
>> org.apache.lucene.index.CorruptIndexException: Unexpected file read error
>> while
>> reading index.
>>
>> (resource=BufferedChecksumIndexInput(MMapIndexInput(path="D:\i\202204\segments_10fj")))
>>         at
>> org.apache.lucene.index.SegmentInfos.readCommit(SegmentInfos.java:290)
>>         at
>> org.apache.lucene.index.IndexFileDeleter.<init>(IndexFileDeleter.java:165)
>>         at
>> org.apache.lucene.index.IndexWriter.<init>(IndexWriter.java:972)
>> Caused by: java.nio.file.NoSuchFileException: D:\i\202204\_14gb.si
>>         at sun.nio.fs.WindowsException.translateToIOException(Unknown
>> Source)
>>         at sun.nio.fs.WindowsException.rethrowAsIOException(Unknown
>> Source)
>>         at sun.nio.fs.WindowsException.rethrowAsIOException(Unknown
>> Source)
>>         at sun.nio.fs.WindowsFileSystemProvider.newFileChannel(Unknown
>> Source)
>>         at java.nio.channels.FileChannel.open(Unknown Source)
>>         at java.nio.channels.FileChannel.open(Unknown Source)
>>         at
>> org.apache.lucene.store.MMapDirectory.openInput(MMapDirectory.java:238)
>>         at
>> org.apache.lucene.store.Directory.openChecksumInput(Directory.java:137)
>>         at
>>
>> org.apache.lucene.codecs.lucene62.Lucene62SegmentInfoFormat.read(Lucene62SegmentInfoFormat.java:89)
>>         at
>> org.apache.lucene.index.SegmentInfos.readCommit(SegmentInfos.java:357)
>>         at
>> org.apache.lucene.index.SegmentInfos.readCommit(SegmentInfos.java:288)
>>         ... 2 more
>>
>> Traceback (most recent call last):
>>   File "index.py", line 112, in start
>>     writer = IndexWriter(index_directory, iconfig)
>> lucene.JavaError: <super: <class 'JavaError'>, <JavaError object>>
>>     Java stacktrace:
>> org.apache.lucene.index.CorruptIndexException: Unexpected file read error
>> while
>> reading index.
>>
>> (resource=BufferedChecksumIndexInput(MMapIndexInput(path="D:\i\202204\segments_10fj")))
>>         at
>> org.apache.lucene.index.SegmentInfos.readCommit(SegmentInfos.java:290)
>>         at
>> org.apache.lucene.index.IndexFileDeleter.<init>(IndexFileDeleter.java:165)
>>         at
>> org.apache.lucene.index.IndexWriter.<init>(IndexWriter.java:972)
>> Caused by: java.nio.file.NoSuchFileException: D:\i\202204\_14gb.si
>>         at sun.nio.fs.WindowsException.translateToIOException(Unknown
>> Source)
>>         at sun.nio.fs.WindowsException.rethrowAsIOException(Unknown
>> Source)
>>         at sun.nio.fs.WindowsException.rethrowAsIOException(Unknown
>> Source)
>>         at sun.nio.fs.WindowsFileSystemProvider.newFileChannel(Unknown
>> Source)
>>         at java.nio.channels.FileChannel.open(Unknown Source)
>>         at java.nio.channels.FileChannel.open(Unknown Source)
>>         at
>> org.apache.lucene.store.MMapDirectory.openInput(MMapDirectory.java:238)
>>         at
>> org.apache.lucene.store.Directory.openChecksumInput(Directory.java:137)
>>         at
>>
>> org.apache.lucene.codecs.lucene62.Lucene62SegmentInfoFormat.read(Lucene62SegmentInfoFormat.java:89)
>>         at
>> org.apache.lucene.index.SegmentInfos.readCommit(SegmentInfos.java:357)
>>         at
>> org.apache.lucene.index.SegmentInfos.readCommit(SegmentInfos.java:288)
>>         ... 2 more
>>
>>
>> Regards,
>> Antony
>>
>> On Sat, 30 Apr 2022 at 10:59, Robert Muir <rcm...@gmail.com> wrote:
>>
>> > The most helpful thing would be the full stacktrace of the exception.
>> > This exception should be chaining the original exception and call
>> > site, and maybe tell us more about this error you hit.
>> >
>> > To me, it looks like a windows-specific issue where the filesystem is
>> > returning an unexpected error. So it would be helpful to see exactly
>> > which one that is, and the full trace of where it comes from, to chase
>> > it further
>> >
>> > On Thu, Apr 28, 2022 at 12:10 PM Antony Joseph
>> > <antony.dev.webm...@gmail.com> wrote:
>> > >
>> > > Thank you for your reply.
>> > >
>> > > This isn't happening in a single environment. Our application is being
>> > used
>> > > by various clients and this has been reported by multiple users - all
>> of
>> > > whom were running the earlier pylucene (v4.10) - without issues.
>> > >
>> > > One thing to mention is that our earlier version used Python 2.7.15
>> (with
>> > > pylucene 4.10) and now we are using Python 3.8.10 with Pylucene 6.5.0
>> -
>> > the
>> > > indexing logic is the same...
>> > >
>> > > One other thing to note is that the issue described has (so far!) only
>> > > occurred on MS Windows - none of our Linux customers have complained
>> > about
>> > > this.
>> > >
>> > > Any ideas?
>> > >
>> > > Regards,
>> > > Antony
>> > >
>> > > On Thu, 28 Apr 2022 at 17:00, Adrien Grand <jpou...@gmail.com> wrote:
>> > >
>> > > > Hi Anthony,
>> > > >
>> > > > This isn't something that you should try to fix programmatically,
>> > > > corruptions indicate that something is wrong with the environment,
>> > > > like a broken disk or corrupt RAM. I would suggest running a memtest
>> > > > to check your RAM and looking at system logs in case they have
>> > > > anything to tell about your disks.
>> > > >
>> > > > Can you also share the full stack trace of the exception?
>> > > >
>> > > > On Thu, Apr 28, 2022 at 10:26 AM Antony Joseph
>> > > > <antony.dev.webm...@gmail.com> wrote:
>> > > > >
>> > > > > Hello,
>> > > > >
>> > > > > We are facing a strange situation in our application as described
>> > below:
>> > > > >
>> > > > > *Using*:
>> > > > >
>> > > > >    - Python 3.8.10
>> > > > >    - Pylucene 6.5.0
>> > > > >    - Java 8 (1.8.0_181)
>> > > > >    - Runs on Linux and Windows (error seen on Windows)
>> > > > >
>> > > > > We suddenly get the following *error*:
>> > > > >
>> > > > > 2022-02-10 09:58:09.253215: ERROR : writer | Failed to get index
>> > > > > (D:\i\202202) writer, Exception:
>> > > > > org.apache.lucene.index.CorruptIndexException: Unexpected file
>> read
>> > error
>> > > > > while reading index.
>> > > > >
>> > > >
>> >
>> (resource=BufferedChecksumIndexInput(MMapIndexInput(path="D:\i\202202\segments_fo")))
>> > > > >
>> > > > >
>> > > > > After this, no further indexing happens - trying to open the index
>> > for
>> > > > > writing throws the above error - and the index writer does not
>> open.
>> > > > >
>> > > > > FYI, our code contains the following *settings*:
>> > > > >
>> > > > > index_path = "D:\i\202202"
>> > > > > index_directory = FSDirectory.open(Paths.get(index_path))
>> > > > > iconfig = IndexWriterConfig(wrapper_analyzer)
>> > > > > iconfig.setOpenMode(IndexWriterConfig.OpenMode.CREATE_OR_APPEND)
>> > > > > iconfig.setRAMBufferSizeMB(16.0)
>> > > > > writer = IndexWriter(index_directory, iconfig)
>> > > > >
>> > > > >
>> > > > > *Repairing*
>> > > > > We tried 'repairing' the index with the following command / tool:
>> > > > >
>> > > > > java -cp lucene-core-6.5.0.jar:lucene-backward-codecs-6.5.0.jar
>> > > > > org.apache.lucene.index.CheckIndex "D:\i\202202" -exorcise
>> > > > >
>> > > > > This however returns saying "No problems found with the index."
>> > > > >
>> > > > >
>> > > > > *Work around*
>> > > > > We have to manually delete the problematic segment file:
>> > > > > D:\i\202202\segments_fo
>> > > > > after which the application starts again... until the next
>> > corruption. We
>> > > > > can't spot a specific pattern.
>> > > > >
>> > > > >
>> > > > > *Two questions:*
>> > > > >
>> > > > >    1. Can we handle this situation programmatically, so that no
>> > manual
>> > > > >    intervention is needed?
>> > > > >    2. Any reason why we are facing the corruption issue in the
>> first
>> > > > place?
>> > > > >
>> > > > >
>> > > > > Before this we were using Pylucene 4.10 and we didn't face this
>> > problem -
>> > > > > the application logic is the same.
>> > > > >
>> > > > > Also, while the application runs on both Linux and Windows, so
>> far we
>> > > > have
>> > > > > observed this situation only on various Windows platforms.
>> > > > >
>> > > > > Would really appreciate some assistance. Thanks in advance.
>> > > > >
>> > > > > Regards,
>> > > > > Antony
>> > > >
>> > > >
>> > > >
>> > > > --
>> > > > Adrien
>> > > >
>> > > >
>> ---------------------------------------------------------------------
>> > > > To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
>> > > > For additional commands, e-mail: java-user-h...@lucene.apache.org
>> > > >
>> > > >
>> >
>> > ---------------------------------------------------------------------
>> > To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
>> > For additional commands, e-mail: java-user-h...@lucene.apache.org
>> >
>> >
>>
>
 Volume in drive D is APP Data
 Volume Serial Number is 742A-8BB3

 Directory of D:\i\202204

04/23/2022  03:39 PM    <DIR>          .
04/23/2022  03:39 PM    <DIR>          ..
04/23/2022  02:33 AM             2,153 segments_10fj
04/23/2022  02:34 AM             1,633 segments_10fk
04/01/2022  02:34 AM                 0 write.lock
04/21/2022  06:47 PM               405 _10ya.cfe
04/21/2022  06:47 PM       404,514,656 _10ya.cfs
04/21/2022  06:47 PM               421 _10ya.si
04/22/2022  10:52 AM               405 _12ok.cfe
04/22/2022  10:52 AM       408,755,035 _12ok.cfs
04/22/2022  10:52 AM               421 _12ok.si
04/22/2022  01:10 PM               405 _1313.cfe
04/22/2022  01:10 PM       444,171,534 _1313.cfs
04/22/2022  01:10 PM               421 _1313.si
04/22/2022  04:17 PM               405 _13in.cfe
04/22/2022  04:17 PM       445,896,460 _13in.cfs
04/22/2022  04:17 PM               421 _13in.si
04/22/2022  07:18 PM               405 _13wu.cfe
04/22/2022  07:18 PM       449,649,236 _13wu.cfs
04/22/2022  07:18 PM               421 _13wu.si
04/23/2022  12:01 AM               405 _14br.cfe
04/23/2022  12:01 AM       413,394,102 _14br.cfs
04/23/2022  12:01 AM               421 _14br.si
04/23/2022  12:01 AM           151,733 _14bs.cfs
04/23/2022  12:02 AM           103,020 _14bt.cfs
04/23/2022  12:02 AM            87,781 _14bu.cfs
04/23/2022  12:03 AM             9,957 _14bv.cfs
04/23/2022  12:03 AM            62,878 _14bw.cfs
04/23/2022  12:04 AM            16,847 _14bx.cfs
04/23/2022  12:05 AM            10,764 _14by.cfs
04/23/2022  12:06 AM            27,356 _14bz.cfs
04/23/2022  12:07 AM               405 _14c0.cfe
04/23/2022  12:07 AM         1,895,475 _14c0.cfs
04/23/2022  12:07 AM               383 _14c0.si
04/23/2022  12:08 AM            40,965 _14c1.cfs
04/23/2022  12:09 AM            82,949 _14c2.cfs
04/23/2022  12:09 AM            42,307 _14c3.cfs
04/23/2022  12:10 AM            22,726 _14c4.cfs
04/23/2022  12:10 AM            41,847 _14c5.cfs
04/23/2022  12:11 AM            72,802 _14c6.cfs
04/23/2022  12:11 AM            58,799 _14c7.cfs
04/23/2022  12:12 AM            88,753 _14c8.cfs
04/23/2022  12:13 AM             7,563 _14c9.cfs
04/23/2022  12:15 AM            46,191 _14ce.cfs
04/23/2022  12:15 AM            81,462 _14cf.cfs
04/23/2022  12:16 AM            11,370 _14cg.cfs
04/23/2022  12:17 AM            38,332 _14ch.cfs
04/23/2022  12:17 AM            26,131 _14ci.cfs
04/23/2022  12:18 AM            46,270 _14cj.cfs
04/23/2022  12:19 AM            17,319 _14ck.cfs
04/23/2022  12:20 AM            15,684 _14cl.cfs
04/23/2022  12:21 AM            59,915 _14cm.cfs
04/23/2022  12:22 AM             7,005 _14cn.cfs
04/23/2022  12:23 AM           167,162 _14cp.cfs
04/23/2022  12:25 AM           170,016 _14cr.cfs
04/23/2022  12:27 AM           271,712 _14cz.cfs
04/23/2022  12:30 AM           107,240 _14d2.cfs
04/23/2022  12:36 AM           316,753 _14d9.cfs
04/23/2022  12:40 AM               405 _14dd.cfe
04/23/2022  12:40 AM           447,407 _14dd.cfs
04/23/2022  12:40 AM               383 _14dd.si
04/23/2022  12:44 AM           269,327 _14dj.cfs
04/23/2022  12:51 AM           175,784 _14dt.cfs
04/23/2022  12:55 AM               405 _14dz.cfe
04/23/2022  12:55 AM           415,372 _14dz.cfs
04/23/2022  12:55 AM               383 _14dz.si
04/23/2022  12:56 AM           273,924 _14e3.cfs
04/23/2022  01:01 AM           242,125 _14ed.cfs
04/23/2022  01:07 AM               405 _14en.cfe
04/23/2022  01:07 AM           449,616 _14en.cfs
04/23/2022  01:07 AM               421 _14en.si
04/23/2022  01:12 AM               405 _14ex.cfe
04/23/2022  01:12 AM           441,454 _14ex.cfs
04/23/2022  01:12 AM               421 _14ex.si
04/23/2022  01:13 AM            46,465 _14ey.cfs
04/23/2022  01:20 AM               405 _14f7.cfe
04/23/2022  01:20 AM           450,403 _14f7.cfs
04/23/2022  01:20 AM               421 _14f7.si
04/23/2022  01:28 AM               405 _14fh.cfe
04/23/2022  01:28 AM           467,352 _14fh.cfs
04/23/2022  01:28 AM               421 _14fh.si
04/23/2022  01:34 AM           297,338 _14fr.cfs
04/23/2022  01:42 AM           285,794 _14g1.cfs
04/23/2022  01:49 AM           380,954 _14gb.cfs
04/23/2022  01:57 AM               405 _14gl.cfe
04/23/2022  01:57 AM           470,249 _14gl.cfs
04/23/2022  01:57 AM               421 _14gl.si
04/23/2022  02:02 AM               405 _14gv.cfe
04/23/2022  02:02 AM           501,988 _14gv.cfs
04/23/2022  02:02 AM               421 _14gv.si
04/23/2022  02:06 AM               405 _14h5.cfe
04/23/2022  02:06 AM           472,582 _14h5.cfs
04/23/2022  02:06 AM               421 _14h5.si
04/23/2022  02:10 AM               405 _14hf.cfe
04/23/2022  02:10 AM           516,337 _14hf.cfs
04/23/2022  02:10 AM               421 _14hf.si
04/23/2022  02:11 AM            11,267 _14hg.cfs
04/23/2022  02:11 AM            55,151 _14hh.cfs
04/23/2022  02:12 AM            15,565 _14hi.cfs
04/23/2022  02:12 AM            88,140 _14hj.cfs
04/23/2022  02:13 AM            69,674 _14hk.cfs
04/23/2022  02:14 AM            23,462 _14hl.cfs
04/23/2022  02:15 AM            10,100 _14hm.cfs
04/23/2022  02:15 AM            45,510 _14hn.cfs
04/23/2022  02:17 AM               405 _14hp.cfe
04/23/2022  02:17 AM           547,179 _14hp.cfs
04/23/2022  02:17 AM               421 _14hp.si
04/23/2022  02:17 AM             8,662 _14hq.cfs
04/23/2022  02:25 AM               405 _14hz.cfe
04/23/2022  02:25 AM           403,710 _14hz.cfs
04/23/2022  02:25 AM               421 _14hz.si
04/23/2022  02:26 AM            84,866 _14i0.cfs
04/23/2022  02:27 AM            18,715 _14i1.cfs
04/23/2022  02:28 AM             9,221 _14i2.cfs
04/23/2022  02:30 AM            83,826 _14i3.cfs
04/23/2022  02:31 AM            30,755 _14i4.cfs
04/23/2022  02:31 AM            24,798 _14i5.cfs
04/23/2022  02:32 AM            27,181 _14i6.cfs
04/23/2022  02:33 AM             7,005 _14i7.cfs
04/23/2022  02:34 AM               405 _14i9.cfe
04/23/2022  02:34 AM           558,333 _14i9.cfs
04/23/2022  02:34 AM               421 _14i9.si
04/05/2022  11:29 AM                98 _664.dii
04/05/2022  11:29 AM         3,140,765 _664.dim
04/05/2022  11:28 AM        51,797,110 _664.fdt
04/05/2022  11:28 AM            21,038 _664.fdx
04/05/2022  11:29 AM             4,291 _664.fnm
04/05/2022  11:29 AM         1,515,011 _664.nvd
04/05/2022  11:29 AM               188 _664.nvm
04/05/2022  11:29 AM               583 _664.si
04/05/2022  11:29 AM        92,453,822 _664_Lucene50_0.doc
04/05/2022  11:29 AM       234,246,791 _664_Lucene50_0.pos
04/05/2022  11:29 AM       280,169,598 _664_Lucene50_0.tim
04/05/2022  11:29 AM         2,532,894 _664_Lucene50_0.tip
04/05/2022  11:29 AM         2,935,314 _664_Lucene54_0.dvd
04/05/2022  11:29 AM               363 _664_Lucene54_0.dvm
04/19/2022  06:37 PM                95 _yjj.dii
04/19/2022  06:37 PM         1,897,032 _yjj.dim
04/19/2022  06:36 PM        30,489,864 _yjj.fdt
04/19/2022  06:36 PM            12,808 _yjj.fdx
04/19/2022  06:37 PM             4,455 _yjj.fnm
04/19/2022  06:37 PM           913,151 _yjj.nvd
04/19/2022  06:37 PM               188 _yjj.nvm
04/19/2022  06:37 PM               583 _yjj.si
04/19/2022  06:37 PM        52,854,765 _yjj_Lucene50_0.doc
04/19/2022  06:37 PM       133,018,229 _yjj_Lucene50_0.pos
04/19/2022  06:37 PM       178,174,040 _yjj_Lucene50_0.tim
04/19/2022  06:37 PM         1,662,130 _yjj_Lucene50_0.tip
04/19/2022  06:37 PM         1,854,815 _yjj_Lucene54_0.dvd
04/19/2022  06:37 PM               363 _yjj_Lucene54_0.dvm
04/21/2022  10:58 AM                96 _zs9.dii
04/21/2022  10:58 AM         2,181,856 _zs9.dim
04/21/2022  10:58 AM        34,798,219 _zs9.fdt
04/21/2022  10:58 AM            14,990 _zs9.fdx
04/21/2022  10:58 AM             4,291 _zs9.fnm
04/21/2022  10:58 AM         1,060,415 _zs9.nvd
04/21/2022  10:58 AM               188 _zs9.nvm
04/21/2022  10:58 AM               583 _zs9.si
04/21/2022  10:58 AM        57,928,377 _zs9_Lucene50_0.doc
04/21/2022  10:58 AM       147,299,467 _zs9_Lucene50_0.pos
04/21/2022  10:58 AM       181,792,715 _zs9_Lucene50_0.tim
04/21/2022  10:58 AM         1,683,131 _zs9_Lucene50_0.tip
04/21/2022  10:58 AM         1,921,992 _zs9_Lucene54_0.dvd
04/21/2022  10:58 AM               363 _zs9_Lucene54_0.dvm
04/21/2022  11:48 AM                96 _ztz.dii
04/21/2022  11:48 AM         2,124,160 _ztz.dim
04/21/2022  11:48 AM        38,019,415 _ztz.fdt
04/21/2022  11:48 AM            14,928 _ztz.fdx
04/21/2022  11:48 AM             4,291 _ztz.fnm
04/21/2022  11:48 AM         1,031,495 _ztz.nvd
04/21/2022  11:48 AM               188 _ztz.nvm
04/21/2022  11:48 AM               583 _ztz.si
04/21/2022  11:48 AM        64,060,384 _ztz_Lucene50_0.doc
04/21/2022  11:48 AM       163,985,641 _ztz_Lucene50_0.pos
04/21/2022  11:48 AM       229,803,629 _ztz_Lucene50_0.tim
04/21/2022  11:48 AM         2,146,560 _ztz_Lucene50_0.tip
04/21/2022  11:48 AM         1,912,551 _ztz_Lucene54_0.dvd
04/21/2022  11:48 AM               363 _ztz_Lucene54_0.dvm
             176 File(s)  4,580,827,241 bytes
               2 Dir(s)  111,429,218,304 bytes free
---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org

Reply via email to