[
https://issues.apache.org/jira/browse/LUCENE-4139?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13293554#comment-13293554
]
Robert Muir commented on LUCENE-4139:
-------------------------------------
Looks like we arent summing up offsets correctly for multivalued fields, thus
they go backwards.
I added this assert to the postingswriter:
assert offsetDelta >= 0 && offsetLength >= 0 : "startOffset=" +
startOffset + ",lastOffset=" + lastOffset + ",endOffset=" + endOffset;
[junit4] > Throwable #1: java.lang.AssertionError:
startOffset=26,lastOffset=34,endOffset=29
[junit4] > at
__randomizedtesting.SeedInfo.seed([76B886A04FD18EEC:D9439B78AFF692]:0)
[junit4] > at
org.apache.lucene.codecs.lucene40.Lucene40PostingsWriter.addPosition(Lucene40PostingsWriter.java:255)
> multivalued field with offsets makes corrumpt index
> ----------------------------------------------------
>
> Key: LUCENE-4139
> URL: https://issues.apache.org/jira/browse/LUCENE-4139
> Project: Lucene - Java
> Issue Type: Bug
> Affects Versions: 4.0
> Reporter: Robert Muir
> Attachments: LUCENE-4139_test.patch, LUCENE-4139_test.patch
>
>
> I was trying to beef up TestBackwardsCompatibility (LUCENE-4085) but i
> accidentally made a corrupt index due to a typo:
> {code}
> // a field with both offsets and term vectors for a cross-check
> FieldType customType3 = new FieldType(TextField.TYPE_STORED);
> customType3.setStoreTermVectors(true);
> customType3.setStoreTermVectorPositions(true);
> customType3.setStoreTermVectorOffsets(true);
> customType3.setIndexOptions(IndexOptions.DOCS_AND_FREQS_AND_POSITIONS_AND_OFFSETS);
> doc.add(new Field("content3", "here is more content with aaa aaa aaa",
> customType3));
> // a field that omits only positions
> FieldType customType4 = new FieldType(TextField.TYPE_STORED);
> customType4.setStoreTermVectors(true);
> customType4.setStoreTermVectorPositions(false);
> customType4.setStoreTermVectorOffsets(true);
> customType4.setIndexOptions(IndexOptions.DOCS_AND_FREQS);
> // check out the copy-paste typo here! i forgot to change this to content4
> doc.add(new Field("content3", "here is more content with aaa aaa aaa",
> customType3));
> {code}
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]