Michael McCandless created LUCENE-7626:
------------------------------------------
Summary: IndexWriter shouldn't accept broken offsets
Key: LUCENE-7626
URL: https://issues.apache.org/jira/browse/LUCENE-7626
Project: Lucene - Core
Issue Type: Improvement
Reporter: Michael McCandless
Assignee: Michael McCandless
Fix For: master (7.0)
Attachments: LUCENE-7626.patch
I think we should do this in 7.0 (not 6.x).
Long ago we stopped accepting broken offsets (where the start offset
for a token is before the start offset of the last token) in postings
(LUCENE-4127), but we are still lenient with term vectors.
I think we should also check for term vectors: this would let users
know that their analysis chain is producing offsets that cannot be
used properly at search time.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]