[ 
https://issues.apache.org/jira/browse/LUCENE-8284?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16460819#comment-16460819
 ] 

Jim Ferenczi commented on LUCENE-8284:
--------------------------------------

I agree with Adrien, just exposing multi term queries without limitations is 
not going to scale and will certainly produce OOME or very slow query.
(edge) ngrams indexing should be used for prefixes and infixes matching but it 
won't work for fuzzy or regex queries so another option would be to accept 
multi term queries but only if they use top terms rewriting.
So it would only select the top terms (and we can limit the number to 
Boolean.MAX_CLAUSE_COUNT) and translate them into term intervals ? 
maxExpansions should be a mandatory parameter for this kind of source in order 
to make sure that users are aware that only a subset of the matching terms are 
considered.

> Add MultiTermsIntervalsSource
> -----------------------------
>
>                 Key: LUCENE-8284
>                 URL: https://issues.apache.org/jira/browse/LUCENE-8284
>             Project: Lucene - Core
>          Issue Type: Improvement
>            Reporter: Matt Weber
>            Priority: Minor
>         Attachments: LUCENE-8284.patch
>
>
> Add support for creating an {{IntervalsSource}} from multi-term expansions 
> such as wildcards, regular expressions, etc.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Reply via email to