On Tue, Apr 26, 2022 at 8:40 AM Michael McCandless <[email protected]> wrote: > > But then I don't think our testing of too-long terms, which happens to real > users, is great. We have a dedicated unit test case > (TestIndexWriter.testWickedLongTerm) which specifically confirms that the > inverted index will be OK (and throw the right exception) if you attempt to > index a massive term. But what about all our analyzers? Do they handle > too-long terms? Does TestRandomChains sometimes inject massive terms? Or > our random realistic Unicode string generation methods? >
Hi Mike, I don't think this is the only unit test for indexwriter for this situation. There is also a whole dedicated class: https://github.com/apache/lucene/blob/main/lucene/core/src/test/org/apache/lucene/index/TestExceedMaxTermLength.java --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
