Re: Semantics and performance regarding min number of the optional BooleanClauses

2020-03-30 Thread Stamatis Zampetakis
Thanks Adrien, indeed I missed the difference in the score since in my context I do not need them most of the time. I will try to check if there is something that can be done with respect to the rewriting and if something promising comes up I will let you know. Best, Stamatis On Mon, Mar 30, 20

Re: Semantics and performance regarding min number of the optional BooleanClauses

2020-03-30 Thread Adrien Grand
Hi Stamatis, One thing that you missed regarding semantics is scoring. While (A B C)~2 and ((+A +B) (+A +C) (+B +C)) would match the same documents, they would produce different scores. Moreover, many users come to this query because it is exactly what they need: matching k out of n clauses. In t

Semantics and performance regarding min number of the optional BooleanClauses

2020-03-30 Thread Stamatis Zampetakis
Hi all, My question concerns the method setMinimumNumberShouldMatch in BooleaQuery class. Lets assume that we have 3 queries (optional clauses), namely A, B, C and we build a BooleanQuery specifying that at least 2 should match. In terms of semantics what I understand so far is that (A B C)~2 i