[
https://issues.apache.org/jira/browse/LUCENE-8284?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16460819#comment-16460819
]
Jim Ferenczi commented on LUCENE-8284:
--------------------------------------
I agree with Adrien, just exposing multi term queries without limitations is
not going to scale and will certainly produce OOME or very slow query.
(edge) ngrams indexing should be used for prefixes and infixes matching but it
won't work for fuzzy or regex queries so another option would be to accept
multi term queries but only if they use top terms rewriting.
So it would only select the top terms (and we can limit the number to
Boolean.MAX_CLAUSE_COUNT) and translate them into term intervals ?
maxExpansions should be a mandatory parameter for this kind of source in order
to make sure that users are aware that only a subset of the matching terms are
considered.
> Add MultiTermsIntervalsSource
> -----------------------------
>
> Key: LUCENE-8284
> URL: https://issues.apache.org/jira/browse/LUCENE-8284
> Project: Lucene - Core
> Issue Type: Improvement
> Reporter: Matt Weber
> Priority: Minor
> Attachments: LUCENE-8284.patch
>
>
> Add support for creating an {{IntervalsSource}} from multi-term expansions
> such as wildcards, regular expressions, etc.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]