mikemccand commented on issue #15773: URL: https://github.com/apache/lucene/issues/15773#issuecomment-4045704684
Also, I think this was @msokolov idea maybe: simple set-overlap recall is a dangerous metric because it doesn't reflect vectors whose scores are really close. So if I get totally different vectors, but their scores were very close to the previous set of vectors, that shouldn't really count as 0% recall. Are there known improvements to recall, "score weighted recall" or so, to take this into account somehow...? One workaround is to report recall@N with many N values -- once N gets beyond those "nearly the same distance" set of result vectors, recall rises, vs the smaller Ns... -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
