Re: Searching within very large subset of documents

2025-08-07 Thread Adrien Grand
Glad to hear it! Le mer. 6 août 2025, 22:32, Thomas Barr a écrit : > For what it's worth, TermInSetQuery worked great! Thanks! > > > On Aug 4, 2025, at 11:26 PM, Adrien Grand wrote: > > > > Hi Thomas, > > > > Your question suggests that you are creating a huge BooleanQuery to > > identify these

Re: Searching within very large subset of documents

2025-08-05 Thread Thomas Barr
The TermInSetQuery query might be just what I need… Essentially, I’m trying to allow a user to provide a file listing a subset of documents with an arbitrary document numbers, stored as a term in the index. After that, I want them to interactively make searches within that subset. Merci, -twb

Re: Searching within very large subset of documents

2025-08-04 Thread Adrien Grand
Hi Thomas, Your question suggests that you are creating a huge BooleanQuery to identify these documents. A TermInSetQuery should perform better. Doing better would require to better understand what you are trying to achieve. For instance if you end up with such a large list of terms because you'r

Searching within very large subset of documents

2025-08-04 Thread Thomas Barr
I have a medium-sized (~10m) Lucene index and I frequently want to repeatedly search within a subset of around ~100k documents. I can increase MaxClauseCount and build up a huge TermQuery, keep that around, then build a BooleanQuery out of the result at runtime, but the resulting query is quite