Michael McCandless created LUCENE-7626: ------------------------------------------
Summary: IndexWriter shouldn't accept broken offsets Key: LUCENE-7626 URL: https://issues.apache.org/jira/browse/LUCENE-7626 Project: Lucene - Core Issue Type: Improvement Reporter: Michael McCandless Assignee: Michael McCandless Fix For: master (7.0) Attachments: LUCENE-7626.patch I think we should do this in 7.0 (not 6.x). Long ago we stopped accepting broken offsets (where the start offset for a token is before the start offset of the last token) in postings (LUCENE-4127), but we are still lenient with term vectors. I think we should also check for term vectors: this would let users know that their analysis chain is producing offsets that cannot be used properly at search time. -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org