[ 
https://issues.apache.org/jira/browse/LUCENE-5267?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Adrien Grand resolved LUCENE-5267.
----------------------------------

    Resolution: Not A Problem

bq. dOff - matchDec <0, so throws java.lang.ArrayIndexOutOfBoundsException
bq. dest.length=33288,dOff=3184,matchDec=34510,matchLen=15,fastLen=16

Indeed, all the lines you pasted make no sense since matchDec should be lower 
than dOff. To me this really looks like your index got corrupted somehow. It 
could be a single corrupt byte that makes LZ4 read a length on 2 bytes instead 
of 1 and this shift makes LZ4 try to decompress bytes that make no sense at 
all, explaining why all matchDecs are all higher than dOff.

There are likely only a few chunks that are broken so if you want to try to get 
back as many documents as possible from the corrupt segment, the following 
piece of code may help https://gist.github.com/jpountz/6461246

> java.lang.ArrayIndexOutOfBoundsException on reading data
> --------------------------------------------------------
>
>                 Key: LUCENE-5267
>                 URL: https://issues.apache.org/jira/browse/LUCENE-5267
>             Project: Lucene - Core
>          Issue Type: Bug
>          Components: core/index
>    Affects Versions: 4.4
>            Reporter: Littlestar
>            Assignee: Adrien Grand
>              Labels: LZ4
>
> java.lang.ArrayIndexOutOfBoundsException
>       at org.apache.lucene.codecs.compressing.LZ4.decompress(LZ4.java:132)
>       at 
> org.apache.lucene.codecs.compressing.CompressionMode$4.decompress(CompressionMode.java:135)
>       at 
> org.apache.lucene.codecs.compressing.CompressingStoredFieldsReader.visitDocument(CompressingStoredFieldsReader.java:336)
>       at 
> org.apache.lucene.index.SegmentReader.document(SegmentReader.java:133)
>       at 
> org.apache.lucene.index.BaseCompositeReader.document(BaseCompositeReader.java:110)
>       at 
> org.apache.lucene.index.SlowCompositeReaderWrapper.document(SlowCompositeReaderWrapper.java:212)
>       at 
> org.apache.lucene.index.FilterAtomicReader.document(FilterAtomicReader.java:365)
>       at 
> org.apache.lucene.index.BaseCompositeReader.document(BaseCompositeReader.java:110)
>       at org.apache.lucene.index.IndexReader.document(IndexReader.java:447)
>       at org.apache.lucene.search.IndexSearcher.doc(IndexSearcher.java:204)



--
This message was sent by Atlassian JIRA
(v6.1#6144)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Reply via email to