[
https://issues.apache.org/jira/browse/LUCENE-7822?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16019043#comment-16019043
]
Martin Amirault commented on LUCENE-7822:
-----------------------------------------
{quote}
Maybe I'm missing something but you still need a ChecksumIndexInput since
checkFooter needs one?
{quote}
In the fix you suggested checkFooter was moved out of the
{{org.apache.lucene.index.SegmentInfos#readCommit(Directory,
ChecksumIndexInput, long)}} method, in which case parameter type could be
changed as {{IndexInput}}.
{quote}
This would add the CorruptIndexException as a suppressed exception to the IAE
so that it is clear this exception was caused by a corruption? It would also be
aligned with eg. how Lucene70SegmentInfoFormat (which also calls
Version.fromBits) handles unexpected exceptions
{quote}
>From my point of view if the checksum do not match, a
>{{CorruptIndexException}} should be thrown, with exceptions possibly resulting
>from it set as suppressed exceptions, eg behaves as if the file checksum was
>done before actually parsing the file content.
In that case if the {{si}} file is small reading it twice is the easiest: first
time for checksum using {{CodecUtil.checksumEntireFile}} as originally
suggested, then parse the content.
I agree this is not efficient, maybe the behavior of {{checkFooter}} method
could be changed ? :
* behave as current one if the {{Throwable}} is not an {{Exception}}
* Otherwise throws a {{CorruptIndexException}} with original exception add as a
suppressed one
Anyway I do not have an overview of the whole source code so I let you have the
last word :)
> IllegalArgumentException thrown instead of a CorruptIndexException
> ------------------------------------------------------------------
>
> Key: LUCENE-7822
> URL: https://issues.apache.org/jira/browse/LUCENE-7822
> Project: Lucene - Core
> Issue Type: Bug
> Affects Versions: 6.5.1
> Reporter: Martin Amirault
> Priority: Minor
> Attachments: LUCENE-7822.patch
>
>
> Similarly to LUCENE-7592 , When an {{*.si}} file is corrupted on very
> specific part an IllegalArgumentException is thrown instead of a
> CorruptIndexException.
> StackTrace (Lucene 6.5.1):
> {code}
> java.lang.IllegalArgumentException: Illegal minor version: 12517381
> at
> __randomizedtesting.SeedInfo.seed([1FEB5987CFA44BE:B8755B5574F9F3BF]:0)
> at org.apache.lucene.util.Version.<init>(Version.java:385)
> at org.apache.lucene.util.Version.<init>(Version.java:371)
> at org.apache.lucene.util.Version.fromBits(Version.java:353)
> at
> org.apache.lucene.codecs.lucene62.Lucene62SegmentInfoFormat.read(Lucene62SegmentInfoFormat.java:97)
> at
> org.apache.lucene.index.SegmentInfos.readCommit(SegmentInfos.java:357)
> at
> org.apache.lucene.index.SegmentInfos.readCommit(SegmentInfos.java:288)
> at org.apache.lucene.index.SegmentInfos$1.doBody(SegmentInfos.java:448)
> at org.apache.lucene.index.SegmentInfos$1.doBody(SegmentInfos.java:445)
> at
> org.apache.lucene.index.SegmentInfos$FindSegmentsFile.run(SegmentInfos.java:692)
> at
> org.apache.lucene.index.SegmentInfos$FindSegmentsFile.run(SegmentInfos.java:644)
> at
> org.apache.lucene.index.SegmentInfos.readLatestCommit(SegmentInfos.java:450)
> at
> org.apache.lucene.index.DirectoryReader.listCommits(DirectoryReader.java:260)
> {code}
> Simple fix would be to add IllegalArgumentException to the catch list at
> {{org/apache/lucene/index/SegmentInfos.java:289}}
> Other variations for the stacktraces:
> {code}
> java.lang.IllegalArgumentException: invalid codec filename '_�.cfs', must
> match: _[a-z0-9]+(_.*)?\..*
> at
> __randomizedtesting.SeedInfo.seed([8B3FDE317B8D634A:A8EE07E5EB4B0B13]:0)
> at
> org.apache.lucene.index.SegmentInfo.checkFileNames(SegmentInfo.java:270)
> at org.apache.lucene.index.SegmentInfo.addFiles(SegmentInfo.java:252)
> at org.apache.lucene.index.SegmentInfo.setFiles(SegmentInfo.java:246)
> at
> org.apache.lucene.codecs.lucene62.Lucene62SegmentInfoFormat.read(Lucene62SegmentInfoFormat.java:248)
> at
> org.apache.lucene.index.SegmentInfos.readCommit(SegmentInfos.java:357)
> at
> org.apache.lucene.index.SegmentInfos.readCommit(SegmentInfos.java:288)
> at org.apache.lucene.index.SegmentInfos$1.doBody(SegmentInfos.java:448)
> at org.apache.lucene.index.SegmentInfos$1.doBody(SegmentInfos.java:445)
> at
> org.apache.lucene.index.SegmentInfos$FindSegmentsFile.run(SegmentInfos.java:692)
> at
> org.apache.lucene.index.SegmentInfos$FindSegmentsFile.run(SegmentInfos.java:644)
> at
> org.apache.lucene.index.SegmentInfos.readLatestCommit(SegmentInfos.java:450)
> at
> org.apache.lucene.index.DirectoryReader.listCommits(DirectoryReader.java:260)
> {code}
> {code}
> java.lang.IllegalArgumentException: An SPI class of type
> org.apache.lucene.codecs.Codec with name 'LucenI62' does not exist. You need
> to add the corresponding JAR file supporting this SPI to your classpath. The
> current classpath supports the following names: [Lucene62, Lucene50,
> Lucene53, Lucene54, Lucene60]
> at
> __randomizedtesting.SeedInfo.seed([925DE160F7260F99:B026EB9373CB6368]:0)
> at org.apache.lucene.util.NamedSPILoader.lookup(NamedSPILoader.java:116)
> at org.apache.lucene.codecs.Codec.forName(Codec.java:116)
> at org.apache.lucene.index.SegmentInfos.readCodec(SegmentInfos.java:424)
> at
> org.apache.lucene.index.SegmentInfos.readCommit(SegmentInfos.java:356)
> at
> org.apache.lucene.index.SegmentInfos.readCommit(SegmentInfos.java:288)
> at org.apache.lucene.index.SegmentInfos$1.doBody(SegmentInfos.java:448)
> at org.apache.lucene.index.SegmentInfos$1.doBody(SegmentInfos.java:445)
> at
> org.apache.lucene.index.SegmentInfos$FindSegmentsFile.run(SegmentInfos.java:692)
> at
> org.apache.lucene.index.SegmentInfos$FindSegmentsFile.run(SegmentInfos.java:644)
> at
> org.apache.lucene.index.SegmentInfos.readLatestCommit(SegmentInfos.java:450)
> at
> org.apache.lucene.index.DirectoryReader.listCommits(DirectoryReader.java:260)
> {code}
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]