On Tue, Apr 26, 2022 at 8:40 AM Michael McCandless <[email protected]> wrote: > > But then I don't think our testing of too-long terms, which happens to real > users, is great. We have a dedicated unit test case > (TestIndexWriter.testWickedLongTerm) which specifically confirms that the > inverted index will be OK (and throw the right exception) if you attempt to > index a massive term. But what about all our analyzers? Do they handle > too-long terms? Does TestRandomChains sometimes inject massive terms? Or > our random realistic Unicode string generation methods? >
Analyzers typically have a "testRandomHugeStrings()" in addition to "testRandom()". It uses huge strings but less iterations of the test (due to time). And yes, this is the same tester-method that TestRandomChains uses. --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
