Thanks for your follow up , we are using NRT replicas

On 12/17/25, 11:01 AM, "Andrey Ukhanov (BLOOMBERG/ 919 3RD A)" 
<[email protected] <mailto:[email protected]>> wrote:


[You don't often get email from [email protected] 
<mailto:[email protected]>. Learn why this is important at 
https://aka.ms/LearnAboutSenderIdentification 
<https://aka.ms/LearnAboutSenderIdentification> ]


Hi Rajeswari, what replication model are you using in Solr? NRT or TLOG/PULL?


From: [email protected] <mailto:[email protected]> At: 12/17/25 
13:59:48 UTC-5:00To: [email protected] <mailto:[email protected]>
Cc: [email protected] <mailto:[email protected]>
Subject: Search results consistency with vector search


Hi All,


Noticed that the vector search results for the same query is different each
time. Both ordering and the records are also different based on which replica
it hits.


All the replicas have same documents and all of them have same embeddings. With
vector similarity parser with minReturn=0.8 , minTraversse=0.8 , the numFound
for specific query varies from 111 to 8 , which is a huge variation.


We are using solr 9.9 and lucene version 9.12.2. I believe this behavior due
to approximate HNSW construction in each replica.




Tried with minTraverseas 0.75 instead 0.8 , this fetches more records
(somewhere in 800s) he variations in numFound is less , but the ordering of
the records and even the record is different in this case also each time.
Is this expected ? . What can be done to get consistent results each time.
Please share your experiences.




Thanks,
Rajeswari







Reply via email to