Michael McCandless created LUCENE-7626:
------------------------------------------

             Summary: IndexWriter shouldn't accept broken offsets
                 Key: LUCENE-7626
                 URL: https://issues.apache.org/jira/browse/LUCENE-7626
             Project: Lucene - Core
          Issue Type: Improvement
            Reporter: Michael McCandless
            Assignee: Michael McCandless
             Fix For: master (7.0)
         Attachments: LUCENE-7626.patch

I think we should do this in 7.0 (not 6.x).

Long ago we stopped accepting broken offsets (where the start offset
for a token is before the start offset of the last token) in postings
(LUCENE-4127), but we are still lenient with term vectors.

I think we should also check for term vectors: this would let users
know that their analysis chain is producing offsets that cannot be
used properly at search time.





--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Reply via email to