[ https://issues.apache.org/jira/browse/LUCENE-8284?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16460819#comment-16460819 ]
Jim Ferenczi commented on LUCENE-8284: -------------------------------------- I agree with Adrien, just exposing multi term queries without limitations is not going to scale and will certainly produce OOME or very slow query. (edge) ngrams indexing should be used for prefixes and infixes matching but it won't work for fuzzy or regex queries so another option would be to accept multi term queries but only if they use top terms rewriting. So it would only select the top terms (and we can limit the number to Boolean.MAX_CLAUSE_COUNT) and translate them into term intervals ? maxExpansions should be a mandatory parameter for this kind of source in order to make sure that users are aware that only a subset of the matching terms are considered. > Add MultiTermsIntervalsSource > ----------------------------- > > Key: LUCENE-8284 > URL: https://issues.apache.org/jira/browse/LUCENE-8284 > Project: Lucene - Core > Issue Type: Improvement > Reporter: Matt Weber > Priority: Minor > Attachments: LUCENE-8284.patch > > > Add support for creating an {{IntervalsSource}} from multi-term expansions > such as wildcards, regular expressions, etc. -- This message was sent by Atlassian JIRA (v7.6.3#76005) --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org