Re: [I] While merging quantized vector fields, how can we minimize information loss? [lucene]

via GitHub Thu, 12 Mar 2026 03:42:44 -0700


mikemccand commented on issue #15773:
URL: https://github.com/apache/lucene/issues/15773#issuecomment-4045704684


   Also, I think this was @msokolov idea maybe: simple set-overlap recall is a 
dangerous metric because it doesn't reflect vectors whose scores are really 
close.  So if I get totally different vectors, but their scores were very close 
to the previous set of vectors, that shouldn't really count as 0% recall.  Are 
there known improvements to recall, "score weighted recall" or so, to take this 
into account somehow...?  One workaround is to report recall@N with many N 
values -- once N gets beyond those "nearly the same distance" set of result 
vectors, recall rises, vs the smaller Ns...


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Re: [I] While merging quantized vector fields, how can we minimize information loss? [lucene]

Reply via email to